[CNN] architecture

Study

jiwon152 2024. 2. 6. 17:26

- sequence of layers

- each layer of a ConvNet transforms one volue of activations to another through a differientable function

- one volume of activations = activation map = feature map

ReLU(nonlinear) layer : activates relevant responses

Fully-Connected Layer : each neuron in a layer will be connected to all the numbers in the previous volume

Pooling Layer : downsampling operation layer

Convolutional Layer : specially designed for ConvNet

* Multi-Layer perceptron : fully-connected layer(s) + ReLU(or sigmoid.. activation function)

* Multi-Layer perceptron과 CNN의 차이 : pooling layer와 convolutional layer가 추가됨

[(Conv-ReLU) * N - Pool ] * M - (FC -ReLU) * K - SoftMax

(N usually ~5, M > 10, 0 <= K <= 2)

vs FC layer

: filter

i - k + 1 ( i : input size, k: kernel size)

: modify the amount of movement of filter

i - k/s + 1 (s : stride)

i - k + 2p/s +1 (p : padding)

: generalizing featrues extracted by convolutional features

: 2D array to single long continuous linear array

: extract features from an image

: decrease size of the convolved feature map

: Max pooling / Average pooling

: weights and biases

: last few layers of CNN architecture

: connect neurons between different layers

: mask

: nullify the contribution of some neurons towards the next layer

Activation function

: determine whether the neuron should be activated or not

: Sigmoid, tanH, Softmax, ReLU