High-Flexibility Designs of Quantized Runtime Reconfigurable Multi-Precision Multipliers

Yuhao Liu; Shubham Rai; Salim Ullah; Akash Kumar

doi:10.1109/LES.2023.3298736

High-Flexibility Designs of Quantized Runtime Reconfigurable Multi-Precision Multipliers

Publikation: Beitrag in Fachzeitschrift › Forschungsartikel › Beigetragen › Begutachtung

Beitragende

Yuhao Liu - , Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI Dresden), Professur für Prozessorentwurf (Prozessor Design) (cfaed) (Autor:in)
Shubham Rai - , Center for Advancing Electronics Dresden (cfaed) (Autor:in)
Salim Ullah - , Professur für Prozessorentwurf (Prozessor Design) (cfaed) (Autor:in)
Akash Kumar - , Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI Dresden), Professur für Prozessorentwurf (Prozessor Design) (cfaed) (Autor:in)

Abstract

Recent research widely explored the quantization schemes on hardware. However, for recent accelerators only supporting 8 bits quantization, such as Google TPU, the lower-precision inputs, such as 1/2-bit quantized neural network models in FINN, need to extend the data width to meet the hardware interface requirements. This conversion influences communication and computing efficiency. To improve the flexibility and throughput of quantized multipliers, our work explores two novel reconfigurable multiplier designs that can repartition the number of input channels in runtime based on input precision and reconfigure the signed/unsigned multiplication modes. In this letter, we explored two novel runtime reconfigurable multi-precision multipliers based on the multiplier-tree and bit-serial multiplier architectures. We evaluated our designs by implementing a systolic array and single-layer neural network accelerator on the Ultra96 FPGA platform. The result shows the flexibility of our implementation and the high speedup for low-precision quantized multiplication working with a fixed data width of the hardware interface.

Details

Originalsprache	Englisch
Seiten (von - bis)	194-197
Seitenumfang	4
Fachzeitschrift	IEEE Embedded Systems Letters
Jahrgang	15
Ausgabenummer	4
Publikationsstatus	Veröffentlicht - 1 Dez. 2023
Peer-Review-Status	Ja

Schlagworte

ASJC Scopus Sachgebiete

Schlagwörter

multi-precision, Multiplier, quantization, runtime reconfiguration