Magnitude and Uncertainty Pruning Criterion for Neural Networks

Ko; Vinnie;, Oehmcke; Stefan;, Gieseke Fabian

Magnitude and Uncertainty Pruning Criterion for Neural Networks

Ko, Vinnie; Oehmcke, Stefan; Gieseke Fabian

Abstract

Neural networks have achieved dramatic improvements in recent years and depict the state-of-the-art methods for many real-world tasks nowadays. One drawback is, however, that many of these models are overparameterized, which makes them both computationally and memory intensive. Furthermore, overparameterization can also lead to undesired overfitting side-effects. Inspired by recently proposed magnitude-based pruning schemes and the Wald test from the field of statistics, we introduce a novel magnitude and uncertainty (M&U) pruning criterion that helps to lessen such shortcomings. One important advantage of our M&U pruning criterion is that it is scale-invariant, a phenomenon that the magnitude-based pruning criterion suffers from. In addition, we present a ``pseudo bootstrap'' scheme, which can efficiently estimate the uncertainty of the weights by using their update information during training. Our experimental evaluation, which is based on various neural network architectures and datasets, shows that our new criterion leads to more compressed models compared to models that are solely based on magnitude-based pruning criteria, with, at the same time, less loss in predictive power.

Keywords
neural networks; pruning

Publication type

Research article in proceedings (conference)

Peer reviewed

Yes

Publication status

Published

Year

2019

Conference

IEEE Big Data, Intelligent Data Mining Special Session

Venue

Los Angeles

Book title

2019 {IEEE} International Conference on Big Data {(IEEE} BigData)

Editor

Baru, Chaitanya K.; Huan, Jun; Khan, Latifur; Hu, Xiaohua; Ak, Ronay; Tian, Yuanyuan; Barga, Roger S.; Zaniolo, Carlo; Lee, Kisung; Ye, Yanfang (Fanny)

Start page

2317

End page

2326

Publisher

IEEE

Place

Los Angeles, USA

Language

English

DOI

https://doi.org/10.1109/BigData47090.2019.9005692

Full text

https://doi.org/