Neural Network Compression

Revision as of 20:36, 2 February 2021 by David (talk | contribs) (David moved page Private:Neural Network Compression to Neural Network Compression over redirect)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Brief survey on neural network compression techniques.

Pruning

Sensitivity Methods

The idea here is to measure how sensitive each neuron is.
I.e., if you remove the neuron, how will it change the output?

Factorization

Resources

Surveys

Pruning algorithms a survey (1993) by Russel Reed
A Survey of Model Compression and Acceleration for Deep Neural Networks (2017) by Cheng et al.

Retrieved from "https://wiki.davidl.me/index.php?title=Neural_Network_Compression&oldid=5046"