Direct transformation techniques for compressed data: General approach and application scenarios

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Contributors

Abstract

Lightweight data compression techniques like dictionary or run-length compression play an important role in main memory database systems. Having decided for a compression scheme for a dataset, the transformation to another scheme is very inefficient today. The common approach works as follows: First, the compressed data is decompressed using the source decompression algorithm resulting in the materialization of the raw data in main memory. Second, the compression algorithm of the destination scheme is applied. This indirect way relies on existing algorithms, but is very inefficient, since the whole uncompressed data has to be materialized as an intermediate step. To overcome these drawbacks, we propose a novel approach called direct transformation, which avoids the materialization of the whole uncompressed data. Our techniques are cache optimized to reduce necessary data accesses. Moreover, we present application scenarios, where such direct transformations can be efficiently applied.

Details

Original languageEnglish
Title of host publicationAdvances in Databases and Information Systems
EditorsLadjel Bellatreche, Tadeusz Morzy, Patrick Valduriez
PublisherSpringer-Verlag
Pages151-165
Number of pages15
ISBN (electronic)978-3-319-23135-8
ISBN (print)978-3-319-23134-1
Publication statusPublished - 2015
Peer-reviewedYes

Publication series

SeriesLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9282
ISSN0302-9743

Conference

Title19th East European Conference on Advances in Databases and Information Systems, ADBIS 2015
Duration8 - 11 September 2015
CityPoitiers
CountryFrance

External IDs

ORCID /0000-0001-8107-2775/work/199215564

Keywords

Research priority areas of TU Dresden

DFG Classification of Subject Areas according to Review Boards

Subject groups, research areas, subject areas according to Destatis

Keywords

  • Efficient algorithms, Lightweight data compression, Main memory database systems