A benchmark framework for data compression techniques
Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review
Contributors
Abstract
Lightweight data compression is frequently applied in main memory database systems to improve query performance. The data processed by such systems is highly diverse. Moreover, there is a high number of existing lightweight compression techniques. Therefore, choosing the optimal technique for a given dataset is non-trivial. Existing approaches are based on simple rules, which do not suffice for such a complex decision. In contrast, our vision is a cost-based approach. However, this requires a detailed cost model, which can only be obtained from a systematic benchmarking of many compression algorithms on many different datasets. A naïve benchmark evaluates every algorithm under consideration separately. This yields many redundant steps and is thus inefficient. We propose an efficient and extensible benchmark framework for compression techniques. Given an ensemble of algorithms, it minimizes the overall run time of the evaluation. We experimentally show that our approach outperforms the naïve approach.
Details
| Original language | English |
|---|---|
| Title of host publication | Performance Evaluation and Benchmarking |
| Editors | Raghunath Nambiar, Meikel Poess |
| Publisher | Springer-Verlag |
| Pages | 77-93 |
| Number of pages | 17 |
| ISBN (electronic) | 978-3-319-31409-9 |
| ISBN (print) | 978-3-319-31408-2 |
| Publication status | Published - 2016 |
| Peer-reviewed | Yes |
Publication series
| Series | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 9508 |
| ISSN | 0302-9743 |
Conference
| Title | 7th TPC Technology Conference on Performance Evaluation and Benchmarking, TPCTC 2015 held in conjunction with 40th International Conference on Very Large Data Bases, VLDB 2015 |
|---|---|
| Duration | 31 August - 4 September 2015 |
| City | Kohala Coast |
| Country | United States of America |
External IDs
| ORCID | /0000-0001-8107-2775/work/198592311 |
|---|
Keywords
Research priority areas of TU Dresden
DFG Classification of Subject Areas according to Review Boards
Subject groups, research areas, subject areas according to Destatis
ASJC Scopus subject areas
Keywords
- Efficient benchmarking, Lightweight data compression, Main memory database systems