Conflict Detection-Based Run-Length Encoding - AVX-512 CD Instruction Set in Action.

Research output: Contribution to book/conference proceedings/anthology/reportConference contributionContributedpeer-review

Contributors

Abstract

Data as well as hardware characteristics are two key aspects for efficient data management. This holds in particular for the field of in-memory data processing. Aside from increasing main memory capacities, efficient in-memory processing benefits from novel processing concepts based on lightweight compressed data. Thus, an active research field deals with the adaptation of new hardware features such as vectorization using SIMD instructions to speedup lightweight data compression algorithms. Following this trend, we propose a novel approach for run-length encoding, a well-known and often applied lightweight compression technique. Our novel approach is based on newly introduced conflict detection (CD) instructions in Intel's AVX-512 instruction set extension. As we are going to show, our CD-based approach has unique properties and outperforms the state-of-the-art RLE approach for data sets with small run lengths.

Details

Original languageEnglish
Title of host publicationProceedings - IEEE 34th International Conference on Data Engineering Workshops, ICDEW 2018
PublisherIEEE, New York [u. a.]
Pages96-101
Number of pages6
ISBN (electronic)9781538663066
Publication statusPublished - 2 Jul 2018
Peer-reviewedYes

Conference

Title34th IEEE International Conference on Data Engineering Workshops, ICDEW 2018
Duration16 - 19 April 2018
CityParis
CountryFrance

External IDs

Scopus 85049809911
ORCID /0000-0001-8107-2775/work/142253476

Keywords

Keywords

  • AVX 512, Lightweight data compression, Run length encoding, Vectorization