1x1 convolution layer classifies every feature vector of the feature map
Limitation : Low resolution predicted score map
→ We use upsampling