5,321
edits
(→Kernel) |
|||
Line 86: | Line 86: | ||
===Kernel=== | ===Kernel=== | ||
Typically these days people use small kernels e.g. \(3 \times 3\) or \(4 \times 4\). | Typically these days people use small kernels e.g. \(3 \times 3\) or \(4 \times 4\) with many conv layers. | ||
However, historically people used larger kernels (e.g. <math>7 \times 7</math>). This leads to more parameters which need to be trained and thus networks cannot be as deep. | |||
Note that in practice, people use multi-channel inputs so the actual kernel will be 3D. | Note that in practice, people use multi-channel inputs so the actual kernel will be 3D. |