A Flexible and Energy-Efficient Compute-in-Memory Accelerator for Kolmogorov–Arnold Networks

Publikation: Beitrag in FachzeitschriftForschungsartikelBeigetragenBegutachtung

Beitragende

  • Chirag Sudarshan - , Forschungszentrum Jülich (Autor:in)
  • Oliver Artner - , Rheinisch-Westfälische Technische Hochschule Aachen, Forschungszentrum Jülich (Autor:in)
  • Paul Philipp Manea - , Forschungszentrum Jülich, Rheinisch-Westfälische Technische Hochschule Aachen (Autor:in)
  • Sebastian Siegel - , Forschungszentrum Jülich (Autor:in)
  • Susanne Hoffmann-Eifert - , Forschungszentrum Jülich (Autor:in)
  • Regina Dittmann - , Rheinisch-Westfälische Technische Hochschule Aachen, Forschungszentrum Jülich (Autor:in)
  • John Paul Strachan - , Forschungszentrum Jülich, Rheinisch-Westfälische Technische Hochschule Aachen (Autor:in)
  • Transregio 404: Next Generation Electronics With Active Devices in Three Dimensions

Abstract

Emerging Kolmogorov–Arnold networks (KANs) replace the linear weights of neural networks with trainable nonlinear functions. This modification is particularly attractive for scientific computing, where KANs can match the accuracy of conventional multilayer perceptrons (MLPs) while reducing model size by up to 100×. However, this efficiency comes at the cost of computationally expensive nonlinear evaluations, unlike conventional MLPs dominated by linear matrix multiplications. We present a flexible and energy-efficient compute-in-memory accelerator tailored for KANs, developed through cross-layer optimization across algorithm, architecture, circuit, and device levels. The accelerator computes arbitrary nonlinear functions using a single-read scheme and read-optimized memory arrays with nonvolatile memristive devices. Our system achieves a lowest energy of 8.69 pJ per KAN function. In terms of energy-delay product, it provides 1996× improvement over CPUs, 208× over standard MLP-oriented compute-in-memory accelerators, and up to 71× over prior KAN accelerators. These results establish energy-efficient hardware primitives for implementing advanced nonlinear networks in scientific computing.

Details

OriginalspracheEnglisch
Aufsatznummere202501220
FachzeitschriftAdvanced Intelligent Systems
Jahrgang8
Ausgabenummer5
Frühes Online-DatumMärz 2026
PublikationsstatusVeröffentlicht - Mai 2026
Peer-Review-StatusJa
Extern publiziertJa

Schlagworte