5,321
edits
No edit summary |
|||
Line 1: | Line 1: | ||
Machine Learning | Machine Learning | ||
==Loss functions== | ==Loss functions== | ||
Line 57: | Line 55: | ||
update using above gradient | update using above gradient | ||
</pre> | </pre> | ||
;Batch Size | |||
* [https://medium.com/mini-distill/effect-of-batch-size-on-training-dynamics-21c14f7a716e A medium post empirically evaluating the effect of batch_size] | |||
===Coordinate Block Descent=== | ===Coordinate Block Descent=== | ||