Uni Notes

❯

❯

❯

8. Convolutional NN

8. Convolutional NN

Good for graphs. Images = very structured graph

Goals:

Want to predict the same label despite image translated/shifted by a few pixels
- Invariance to translation
Want segmentation (labeling pixels) to be translated along input images
- Equivariance with translation

How?

Augment with translated images
Special regularization
Invariance built into pre-processing
Invariance in structure of NN ( $⟹$ convolutional NN)

Convolution Layer

Each unit only depends on all “closeby” inputs (eg. pixels
Weights are reused (sparse)
Convolution: weighted sum of nearby pixels
Key idea: learn the filters

z_{i} = j = m a x (1, i - d + 1) \sum m i n (i, k) w_{j} x_{i - j + 1}

c: with padding, d: without padding

Ex.: $w = [- 1, 1]$ highlights edges. $[0, 1, 1] ⇝ [0, 1, 0]$ double check

z_{1} z_{2} z_{3} z_{4} = w_{2} w_{1} 00 0 w_{2} w_{1} 0 00 w_{2} w_{1} x_{1} x_{2} x_{3}

Stride: Skipping pixels in a convolutionary layers in regular steps

Computing output dimensions:

Applying $m$ different $f \times f$ filters (convoluting by $f \times f \times m$ tensor)
To an $n \times n$ image
Padding $p$
Stride $s$ $⇝$ $l \times l \times m$ tensor with $l = \frac{n + 2 p - f}{s} + 1$

The # of inputs affecting each unit grows with every hidden unit.

Pooling Layers (Subsampling)

Aggregate neighboring entries
Consider either the average or maximum value of a group of neighboring pixels
- Average is just another convolution

Common setup

8. Convolutional NN
Convolution Layer
Pooling Layers (Subsampling)

Backlinks

index

Created with Quartz v4.5.2 © 2026

GitHub
Discord Community