CNN Filter Explorer · ai-explained

Why it matters

Convolutional layers are the feature detectors of vision models. The same 3×3 weight matrix slides over every position, sharing parameters and reducing them from O(HW) to O(k²). Early layers detect edges, middle layers detect textures, deep layers detect object parts — the hierarchy emerges from stacking convolutions, not from designing the features. ViTs replaced convolutions with patches, but the intuition carries forward: local structure first, global reasoning later.

The math

(I * K)[i,j] = Σₘ Σₙ I[i+m, j+n] · K[m,n]   # convolution

ReLU(x) = max(0, x)                           # non-linearity

Output size = (H − k + 1) × (W − k + 1)      # no padding

Anchored to 03-neural-networks/architectures-cnn-rnn.

CNN filters — the sliding dot product

Why it matters

The math