Neural Architecture Search for Low-Precision Neural Networks

Research output: Contribution to book/conference proceedings/anthology/reportConference contributionContributedpeer-review

Contributors

Abstract

In our work, we extend the search space of the differentiable Neural Architecture Search (NAS) by adding bitwidth. The extended NAS algorithm is performed directly with low-precision from scratch without the proxy of full-precision. With our low-precision NAS, we can search for low- and mixed-precision network architectures of Convolutional Neural Networks (CNNs) under specific constraints, such as power consumption. Experiments on the ImageNet dataset demonstrate the effectiveness of our method, where the searched models achieve better accuracy (up to 1.2 percentage point) with smaller model sizes (up to 27 % smaller) and lower power consumption (up to 27 % lower) compared to the state-of-art methods. In our low-precision NAS, sharing of convolution is developed to speed up training and decrease memory consumption. Compared to the FBNet-V2 implementation, our solution reduces training time and memory cost by nearly 3 × and 2 ×, respectively. Furthermore, we adapt the NAS to train the entire supernet instead of a subnet in each iteration to address the insufficient training issue. Besides, we also propose the forward-and-backward scaling method, which addresses the issue by eliminating the vanishing of the forward activations and backward gradients.

Details

Original languageEnglish
Title of host publicationArtificial Neural Networks and Machine Learning - ICANN 2022
EditorsElias Pimenidis, Mehmet Aydin, Plamen Angelov, Chrisina Jayne, Antonios Papaleonidas
PublisherSpringer Science and Business Media B.V.
Pages743-755
Number of pages13
ISBN (electronic)978-3-031-15937-4
ISBN (print)978-3-031-15936-7
Publication statusPublished - 2022
Peer-reviewedYes

Publication series

SeriesLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13532 LNCS
ISSN0302-9743

Conference

Title31st International Conference on Artificial Neural Networks, ICANN 2022
Duration6 - 9 September 2022
CityBristol
CountryUnited Kingdom

Keywords

Keywords

  • Convolutional Neural Network, Low- and mixed-precision, Neural Architecture Search