Topology-aware optimization of big sparse matrices and matrix multiplications on main-memory systems

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/GutachtenBeitrag in KonferenzbandBeigetragenBegutachtung

Beitragende

  • David Kernert - , Technische Universität Dresden (Autor:in)
  • Wolfgang Lehner - , Technische Universität Dresden (Autor:in)
  • Frank Kohler - , SAP Research (Autor:in)

Abstract

Since data sizes of analytical applications are continuously growing, many data scientists are switching from customized micro-solutions to scalable alternatives, such as statistical and scientific databases. However, many algorithms in data mining and science are expressed in terms of linear algebra, which is barely supported by major database vendors and big data solutions. On the other side, conventional linear algebra algorithms and legacy matrix representations are often not suitable for very large matrices. We propose a strategy for large matrix processing on modern multicore systems that is based on a novel, adaptive tile matrix representation (AT MATRIX). Our solution utilizes multiple techniques inspired from database technology, such as multidimensional data partitioning, cardinality estimation, indexing, dynamic rewrites, and many more in order to optimize the execution time. Based thereon we present a matrix multiplication operator ATMULT, which outperforms alternative approaches. The aim of our solution is to overcome the burden for data scientists of selecting appropriate algorithms and matrix storage representations. We evaluated AT MATRIX together with ATMULT on several real-world and synthetic random matrices.

Details

OriginalspracheEnglisch
Titel2016 IEEE 32nd International Conference on Data Engineering, ICDE 2016
Herausgeber (Verlag)IEEE, New York [u. a.]
Seiten823-834
Seitenumfang12
ISBN (elektronisch)9781509020195
PublikationsstatusVeröffentlicht - 22 Juni 2016
Peer-Review-StatusJa
Extern publiziertJa

Publikationsreihe

Reihe International Conference on Data Engineering (ICDE)
ISSN1063-6382

Konferenz

Titel32nd IEEE International Conference on Data Engineering, ICDE 2016
Dauer16 - 20 Mai 2016
StadtHelsinki
LandFinnland

Externe IDs

ORCID /0000-0001-8107-2775/work/142253545