Convolution / Kernel / Feature map dimension

Created	@April 23, 2022
Tags	CNN

15) Describe how convolution works. What about if your inputs are grayscale vs RGB imagery? What determines the shape of the next layer? [src]

https://dev.to/sandeepbalachandran/machine-learning-convolution-with-color-images-2p41

Convolution is a mathematical operation trying to learn the values of filter(s) using backprop, where we have an input I, and an argument, kernel K to produce an output that expresses how the shape of one is modified by another.

Convolutional layer is core building block of CNN, it helps with feature detection.

Kernel K is a set of learnable filters and is small spatially compared to the image but extends through the full depth of the input image.

Dimension of the feature map as a function of the input image size(W), feature detector size(F), Stride(S) and Zero Padding on image(P) is (W−F+2P)/S+1

No. of parameters = (Kernel size * Kernel size * Dimension )+1 = 28