FVLLMONTI: The 3D Neural Network Compute Cube (N2C2) Concept for Efficient Transformer Architectures Towards Speech-to-Speech Translation

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/GutachtenBeitrag in KonferenzbandBeigetragenBegutachtung

Beitragende

  • Ian O'Connor - , Institut des Nanotechnologies de Lyon (Autor:in)
  • Sara Mannaa - , Institut des Nanotechnologies de Lyon (Autor:in)
  • Alberto Bosio - , Institut des Nanotechnologies de Lyon (Autor:in)
  • Bastien Deveautour - , Institut des Nanotechnologies de Lyon (Autor:in)
  • Damien Deleruyelle - , Institut des Nanotechnologies de Lyon (Autor:in)
  • Tetiana Obukhova - , Institut des Nanotechnologies de Lyon (Autor:in)
  • Cédric Marchand - , Institut des Nanotechnologies de Lyon (Autor:in)
  • Jens Trommer - , NaMLab - Nanoelectronic materials laboratory gGmbH (Autor:in)
  • Cigdem Cakirlar - , NaMLab - Nanoelectronic materials laboratory gGmbH (Autor:in)
  • Bruno Neckel Wesling - , NaMLab - Nanoelectronic materials laboratory gGmbH (Autor:in)
  • Thomas Mikolajick - , NaMLab - Nanoelectronic materials laboratory gGmbH (Autor:in)
  • Oskar Baumgartner - , Global TCAD Solutions GmbH (Autor:in)
  • Mischa Thesberg - , Global TCAD Solutions GmbH (Autor:in)
  • David Pirker - , Global TCAD Solutions GmbH (Autor:in)
  • Christoph Lenz - , Global TCAD Solutions GmbH (Autor:in)
  • Zlatan Stanojevic - , Global TCAD Solutions GmbH (Autor:in)
  • Markus Karner - , Global TCAD Solutions GmbH (Autor:in)
  • Guilhem Larrieu - , Laboratoire d'Analyse et d'Architecture des Systemes (Autor:in)
  • Sylvain Pelloquin - , Laboratoire d'Analyse et d'Architecture des Systemes (Autor:in)
  • Konstantinous Moustakas - , Laboratoire d'Analyse et d'Architecture des Systemes (Autor:in)
  • Jonas Muller - , Laboratoire d'Analyse et d'Architecture des Systemes (Autor:in)
  • Giovanni Ansaloni - , École Polytechnique Fédérale de Lausanne (Autor:in)
  • Alireza Amirshahi - , École Polytechnique Fédérale de Lausanne (Autor:in)
  • David Atienza - , École Polytechnique Fédérale de Lausanne (Autor:in)
  • Jean Luc Rouas - , Université de Bordeaux (Autor:in)
  • Leila Ben Letaifa - , Université de Bordeaux (Autor:in)
  • Georgeta Bordeall - , Université de Bordeaux (Autor:in)
  • Charles Brazier - , Université de Bordeaux (Autor:in)
  • Chhandak Mukherjee - , Université de Bordeaux (Autor:in)
  • Marina Deng - , Université de Bordeaux (Autor:in)
  • Yifan Wang - , Université de Bordeaux (Autor:in)
  • Marc Francois - , Université de Bordeaux (Autor:in)
  • Houssem Rezgui - , Université de Bordeaux (Autor:in)
  • Reveil Lucas - , Université de Bordeaux (Autor:in)
  • Cristell Maneux - , Université de Bordeaux (Autor:in)

Abstract

This multi-partner-project contribution introduces the midway results of the Horizon 2020 FVLLMONTI project. In this project we develop a new and ultra-efficient class of ANN accelerators, the neural network compute cube (N2C2), which is specifically designed to execute complex machine learning tasks in a 3D technology, in order to provide the high computing power and ultra-high efficiency needed for future edgeAI applications. We showcase its effectiveness by targeting the challenging class of Transformer ANNs, tailored for Automatic Speech Recognition and Machine Translation, the two fundamental components of speech-to-speech translation. To gain the full benefit of the accelerator design, we develop disruptive vertical transistor technologies and execute design-technology-co-optimization (DTCO) loops from single device, to cell and compute cube level. Further, a hardware-software-co-optimization is executed, e.g. by compressing the executed speech recognition and translation models for energy efficient executing without substantial loss in precision.

Details

OriginalspracheEnglisch
Titel2024 Design, Automation and Test in Europe Conference and Exhibition, DATE 2024 - Proceedings
Herausgeber (Verlag)Institute of Electrical and Electronics Engineers (IEEE)
ISBN (elektronisch)9798350348590, 978-3-9819263-8-5
PublikationsstatusVeröffentlicht - 2024
Peer-Review-StatusJa
Extern publiziertJa

Publikationsreihe

ReiheProceedings -Design, Automation and Test in Europe, DATE
ISSN1530-1591

Konferenz

Titel2024 Design, Automation and Test in Europe Conference and Exhibition
KurztitelDATE 2024
Veranstaltungsnummer27
Dauer25 - 27 März 2024
Webseite
OrtPalacio De Congresos De Valencia
StadtValencia
LandSpanien

Externe IDs

ORCID /0000-0003-3814-0378/work/163295411
Scopus 85196503324

Schlagworte

ASJC Scopus Sachgebiete

Schlagwörter

  • ANN, DTCO, emerging technologies, translation