Beyond straightforward vectorization of lightweight data compression algorithms for larger vector sizes

Johannes Pietrzyk; Annett Ungethüm; Dirk Habich; Wolfgang Lehner

Beyond straightforward vectorization of lightweight data compression algorithms for larger vector sizes

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung

Beitragende

Johannes Pietrzyk - , Professur für Datenbanken (Autor:in)
Annett Ungethüm - , Institut für Systemarchitektur, Professur für Datenbanken (Autor:in)
Dirk Habich - , Professur für Datenbanken (Autor:in)
Wolfgang Lehner - , Professur für Datenbanken (Autor:in)

Abstract

Data as well as hardware characteristics are two key aspects for efficient data management. This holds in particular for the field of in-memory data processing. Aside from increasing main memory capacities, efficient in-memory processing benefits from novel processing concepts based on lightweight compressed data. Thus, an active research field deals with the adaptation of new hardware features such as vectorization using SIMD instructions to speeduplightweight data compression algorithms. Most of the vectorized. implementations have been proposed for 128-bit vector registers. A straightforward transformation to wider vector sizes is possible. However, this straightforward way does not exploit the capabilities of newer SIMD extensions to the maximum extent as we will show in this paper. On the one hand, we present a novel implementation concept for run-length encoding using conflict-detection operations which have been introduced in Intel's AVX-512 SIMD extension. On the other hand, we investigate different data layouts for vectorization and their impact on wider vector sizes. Copyright is held by the author/owner(s).

Details

Originalsprache	Englisch
Titel	Grundlagen von Datenbanken
Redakteure/-innen	Gerhard Klassen, Stefan Conrad
Seiten	71-76
Seitenumfang	6
Publikationsstatus	Veröffentlicht - 2018
Peer-Review-Status	Ja

Publikationsreihe

Reihe	CEUR Workshop Proceedings
Band	2126
ISSN	1613-0073

Workshop

Titel	30. GI-Workshop Grundlagen von Datenbanken
Kurztitel	GvDB 2018
Veranstaltungsnummer	30
Dauer	22 - 25 Mai 2018
Webseite	https://ak-grundis.gi.de/gvdb-workshops
Stadt	Wuppertal
Land	Deutschland

Externe IDs

Scopus	85049774436
ORCID	/0000-0001-8107-2775/work/142253475

Schlagworte

ASJC Scopus Sachgebiete

Allgemeine Computerwissenschaft

Forschungsportal der TU Dresden