5,337
edits
Line 11: | Line 11: | ||
===Batch Size=== | ===Batch Size=== | ||
[https://medium.com/mini-distill/effect-of-batch-size-on-training-dynamics-21c14f7a716e A medium post empirically evaluating the effect of batch_size] | [https://medium.com/mini-distill/effect-of-batch-size-on-training-dynamics-21c14f7a716e A medium post empirically evaluating the effect of batch_size] | ||
===Learning Rate=== |