Needles in the haystack — tackling bit flips in lightweight compressed data
Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung
Beitragende
Abstract
Modern database systems are very often in the position to store their entire data in main memory. Aside from increased main emory capacities, a further driver for in-memory database system has been the shift to a column-oriented storage format in combination with lightweight data compression techniques. Using both mentioned software concepts, large datasets can be held and efficiently processed in main memory with a low memory footprint. Unfortunately, hardware becomes more and more vulnerable to random faults, so that e.g., the probability rate for bit flips in main memory increases, and this rate is likely to escalate in future dynamic random-access memory (DRAM) modules. Since the data is highly compressed by the lightweight compression algorithms, multi bit flips will have an extreme impact on the reliability of database systems. To tackle this reliability issue, we introduce our research on error resilient lightweight data compression algorithms in this paper. Of course, our software approach lacks the efficiency of hardware realization, but its flexibility and adaptability will play a more important role regarding differing error rates, e.g. due to hardware aging effects and aggressive processor voltage and frequency scaling. Arithmetic AN encoding is one family of codes which is an interesting candidate for effective software-based error detection. We present results of our research showing tradeoffs between compressibility and resiliency characteristics of data. We show that particular choices of the AN-code parameter lead to a moderate loss of performance. We provide evaluation for two proposed techniques, namely AN-encoded Null Suppression and AN-encoded Run Length Encoding.
Details
| Originalsprache | Englisch |
|---|---|
| Titel | Data Management Technologies and Applications |
| Redakteure/-innen | Orlando Belo, Andreas Holzinger, Markus Helfert, Chiara Francalanci |
| Herausgeber (Verlag) | Springer-Verlag |
| Seiten | 135-153 |
| Seitenumfang | 19 |
| ISBN (elektronisch) | 978-3-319-30162-4 |
| ISBN (Print) | 978-3-319-30161-7 |
| Publikationsstatus | Veröffentlicht - 2016 |
| Peer-Review-Status | Ja |
Publikationsreihe
| Reihe | Communications in Computer and Information Science |
|---|---|
| Band | 584 |
| ISSN | 1865-0929 |
Konferenz
| Titel | 4th International Conference on Data Management Technologies and Applications, DATA 2015 |
|---|---|
| Dauer | 20 - 22 Juli 2015 |
| Stadt | Colmar |
| Land | Frankreich |
Externe IDs
| ORCID | /0000-0001-8107-2775/work/198592307 |
|---|