Weight Sparsity Complements Activity Sparsity in Neuromorphic Language Models

Rishav Mukherji; Mark Schone; Khaleelulla Khan Nazeer; Christian Mayr; David Kappel; Anand Subramoney

doi:10.1109/ICONS62911.2024.00027

Weight Sparsity Complements Activity Sparsity in Neuromorphic Language Models

Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review

Contributors

Rishav Mukherji - , Birla Institute of Technology and Science Pilani (Author)
Mark Schone - , Chair of Highly-Parallel VLSI Systems and Neuro-Microelectronics (Author)
Khaleelulla Khan Nazeer - , Chair of Highly-Parallel VLSI Systems and Neuro-Microelectronics (Author)
Christian Mayr - , Chair of Highly-Parallel VLSI Systems and Neuro-Microelectronics, Clusters of Excellence CeTI: Centre for Tactile Internet, Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI Dresden) (Author)
David Kappel - , Ruhr University Bochum (Author)
Anand Subramoney - , Royal Holloway University of London (Author)

Abstract

Activity and parameter sparsity are two standard methods of making neural networks computationally more efficient. Event-based architectures such as spiking neural networks (SNNs) naturally exhibit activity sparsity, and many methods exist to sparsify their connectivity by pruning weights. While the effect of weight pruning on feed-forward SNNs has been previously studied for computer vision tasks, the effects of pruning for complex sequence tasks like language modeling are less well studied since SNNs have traditionally struggled to achieve meaningful performance on these tasks. Using a recently published SNN-like architecture that works well on small-scale language modeling, we study the effects of weight pruning when combined with activity sparsity. Specifically, we study the tradeoff between the multiplicative efficiency gains the combination affords and its effect on task performance for language modeling. To dissect the effects of the two sparsities, we conduct a comparative analysis between densely activated models and sparsely activated event-based models across varying degrees of connectivity sparsity. We demonstrate that sparse activity and sparse connectivity complement each other without a proportional drop in task performance for an event-based neural network trained on the Penn Treebank and WikiText-2 language modeling datasets. Our results suggest sparsely connected event-based neural networks are promising candidates for effective and efficient sequence modeling.

Details

Original language	English
Title of host publication	Proceedings - 2024 International Conference on Neuromorphic Systems, ICONS 2024
Publisher	Institute of Electrical and Electronics Engineers (IEEE)
Pages	132-139
Number of pages	8
ISBN (electronic)	979-8-3503-6865-9
Publication status	E-pub ahead of print - 2 Dec 2024
Peer-reviewed	Yes

Conference

Title	2024 International Conference on Neuromorphic Systems
Abbreviated title	ICONS 2024
Duration	30 July - 2 August 2024
Website	https://iconsneuromorphic.cc/icons-2024/
Location	George Mason University & Online
City	Arlington
Country	United States of America

External IDs

ORCID	/0000-0001-8525-8702/work/191532878

Keywords

ASJC Scopus subject areas

Artificial Intelligence
Computer Science Applications
Computer Vision and Pattern Recognition
Signal Processing
Modeling and Simulation

Keywords

Event-based neural networks, language modeling, machine learning, pruning, recurrent neural networks, sparsity

Research Portal of the TU Dresden

Weight Sparsity Complements Activity Sparsity in Neuromorphic Language Models

Contributors

Abstract

Details

Conference

External IDs

Keywords

ASJC Scopus subject areas

Keywords

Related content

Weight Sparsity Complements Activity Sparsity in Neuromorphic Language Models