A benchmark framework for data compression techniques

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Contributors

Abstract

Lightweight data compression is frequently applied in main memory database systems to improve query performance. The data processed by such systems is highly diverse. Moreover, there is a high number of existing lightweight compression techniques. Therefore, choosing the optimal technique for a given dataset is non-trivial. Existing approaches are based on simple rules, which do not suffice for such a complex decision. In contrast, our vision is a cost-based approach. However, this requires a detailed cost model, which can only be obtained from a systematic benchmarking of many compression algorithms on many different datasets. A naïve benchmark evaluates every algorithm under consideration separately. This yields many redundant steps and is thus inefficient. We propose an efficient and extensible benchmark framework for compression techniques. Given an ensemble of algorithms, it minimizes the overall run time of the evaluation. We experimentally show that our approach outperforms the naïve approach.

Details

Original languageEnglish
Title of host publicationPerformance Evaluation and Benchmarking
EditorsRaghunath Nambiar, Meikel Poess
PublisherSpringer-Verlag
Pages77-93
Number of pages17
ISBN (electronic)978-3-319-31409-9
ISBN (print)978-3-319-31408-2
Publication statusPublished - 2016
Peer-reviewedYes

Publication series

SeriesLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9508
ISSN0302-9743

Conference

Title7th TPC Technology Conference on Performance Evaluation and Benchmarking, TPCTC 2015 held in conjunction with 40th International Conference on Very Large Data Bases, VLDB 2015
Duration31 August - 4 September 2015
CityKohala Coast
CountryUnited States of America

External IDs

ORCID /0000-0001-8107-2775/work/198592311

Keywords

Research priority areas of TU Dresden

DFG Classification of Subject Areas according to Review Boards

Subject groups, research areas, subject areas according to Destatis

Keywords

  • Efficient benchmarking, Lightweight data compression, Main memory database systems