FVLLMONTI: The 3D Neural Network Compute Cube (N2C2) Concept for Efficient Transformer Architectures Towards Speech-to-Speech Translation

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Contributors

  • Ian O'Connor - , Lyon Institute Of Nanotechnology (Author)
  • Sara Mannaa - , Lyon Institute Of Nanotechnology (Author)
  • Alberto Bosio - , Lyon Institute Of Nanotechnology (Author)
  • Bastien Deveautour - , Lyon Institute Of Nanotechnology (Author)
  • Damien Deleruyelle - , Lyon Institute Of Nanotechnology (Author)
  • Tetiana Obukhova - , Lyon Institute Of Nanotechnology (Author)
  • Cédric Marchand - , Lyon Institute Of Nanotechnology (Author)
  • Jens Trommer - , NaMLab - Nanoelectronic materials laboratory gGmbH (Author)
  • Cigdem Cakirlar - , NaMLab - Nanoelectronic materials laboratory gGmbH (Author)
  • Bruno Neckel Wesling - , NaMLab - Nanoelectronic materials laboratory gGmbH (Author)
  • Thomas Mikolajick - , NaMLab - Nanoelectronic materials laboratory gGmbH (Author)
  • Oskar Baumgartner - , Global TCAD Solutions GmbH (Author)
  • Mischa Thesberg - , Global TCAD Solutions GmbH (Author)
  • David Pirker - , Global TCAD Solutions GmbH (Author)
  • Christoph Lenz - , Global TCAD Solutions GmbH (Author)
  • Zlatan Stanojevic - , Global TCAD Solutions GmbH (Author)
  • Markus Karner - , Global TCAD Solutions GmbH (Author)
  • Guilhem Larrieu - , Laboratoire d'Analyse et d'Architecture des Systemes (Author)
  • Sylvain Pelloquin - , Laboratoire d'Analyse et d'Architecture des Systemes (Author)
  • Konstantinous Moustakas - , Laboratoire d'Analyse et d'Architecture des Systemes (Author)
  • Jonas Muller - , Laboratoire d'Analyse et d'Architecture des Systemes (Author)
  • Giovanni Ansaloni - , Swiss Federal Institute of Technology Lausanne (EPFL) (Author)
  • Alireza Amirshahi - , Swiss Federal Institute of Technology Lausanne (EPFL) (Author)
  • David Atienza - , Swiss Federal Institute of Technology Lausanne (EPFL) (Author)
  • Jean Luc Rouas - , Université de Bordeaux (Author)
  • Leila Ben Letaifa - , Université de Bordeaux (Author)
  • Georgeta Bordeall - , Université de Bordeaux (Author)
  • Charles Brazier - , Université de Bordeaux (Author)
  • Chhandak Mukherjee - , Université de Bordeaux (Author)
  • Marina Deng - , Université de Bordeaux (Author)
  • Yifan Wang - , Université de Bordeaux (Author)
  • Marc Francois - , Université de Bordeaux (Author)
  • Houssem Rezgui - , Université de Bordeaux (Author)
  • Reveil Lucas - , Université de Bordeaux (Author)
  • Cristell Maneux - , Université de Bordeaux (Author)

Abstract

This multi-partner-project contribution introduces the midway results of the Horizon 2020 FVLLMONTI project. In this project we develop a new and ultra-efficient class of ANN accelerators, the neural network compute cube (N2C2), which is specifically designed to execute complex machine learning tasks in a 3D technology, in order to provide the high computing power and ultra-high efficiency needed for future edgeAI applications. We showcase its effectiveness by targeting the challenging class of Transformer ANNs, tailored for Automatic Speech Recognition and Machine Translation, the two fundamental components of speech-to-speech translation. To gain the full benefit of the accelerator design, we develop disruptive vertical transistor technologies and execute design-technology-co-optimization (DTCO) loops from single device, to cell and compute cube level. Further, a hardware-software-co-optimization is executed, e.g. by compressing the executed speech recognition and translation models for energy efficient executing without substantial loss in precision.

Details

Original languageEnglish
Title of host publication2024 Design, Automation and Test in Europe Conference and Exhibition, DATE 2024 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
ISBN (electronic)9798350348590, 978-3-9819263-8-5
Publication statusPublished - 2024
Peer-reviewedYes
Externally publishedYes

Publication series

SeriesProceedings -Design, Automation and Test in Europe, DATE
ISSN1530-1591

Conference

Title2024 Design, Automation and Test in Europe Conference and Exhibition
Abbreviated titleDATE 2024
Conference number27
Duration25 - 27 March 2024
Website
LocationPalacio De Congresos De Valencia
CityValencia
CountrySpain

External IDs

ORCID /0000-0003-3814-0378/work/163295411
Scopus 85196503324

Keywords

ASJC Scopus subject areas

Keywords

  • ANN, DTCO, emerging technologies, translation