Jump to content

Neural Network Compression: Difference between revisions

Line 11: Line 11:
# Compute sensitivity for each parameter.
# Compute sensitivity for each parameter.
# Delete low-saliency parameters.
# Delete low-saliency parameters.
# Continue training and repeat pruning until the number of parameters is low enough or error is too high.
# Continue training to fine-tune remaining parameters.
# Repeat pruning until the number of parameters is low enough or the error is too high.


Sometimes, pruning can also increase accuracy and improve generalization.
Sometimes, pruning can also increase accuracy and improve generalization.