Batch normalization: Difference between revisions

no edit summary
No edit summary
 
Line 7: Line 7:
* An average mean.
* An average mean.
* An average std dev.
* An average std dev.
For CNNs each of these is a vector the size of the number of channels.


During training, these two values are computed from the batch.
During training, these two values are computed from the batch.