5,321
edits
Line 35: | Line 35: | ||
They use 32 kernels per block at the coarsest scale and increase <math>2 \times</math> every 4 scales.<br> | They use 32 kernels per block at the coarsest scale and increase <math>2 \times</math> every 4 scales.<br> | ||
===[https://arxiv.org/pdf/1502.03167.pdf Batch Normalization]=== | ====[https://arxiv.org/pdf/1502.03167.pdf Batch Normalization]==== | ||
; Definitions: | ; Definitions: | ||
* Internal Covariate Shift - the change in distribution of network activations as network parameters change. | * Internal Covariate Shift - the change in distribution of network activations as network parameters change. |