Efficient approximate OLAP querying over time series

Research output: Contribution to book/conference proceedings/anthology/reportConference contributionContributedpeer-review

Contributors

  • Kasun S. Perera - , TUD Dresden University of Technology (Author)
  • Martin Hahmann - , TUD Dresden University of Technology (Author)
  • Wolfgang Lehner - , TUD Dresden University of Technology (Author)
  • Torben Bach Pedersen - , Aalborg University (Author)
  • Christian Thomsen - , Aalborg University (Author)

Abstract

The ongoing trend for data gathering not only produces larger volumes of data, but also increases the variety of recorded data types. Out of these, especially time series, e.g. various sensor readings, have attracted attention in the domains of business intelligence and decision making. As OLAP queries play a major role in these domains, it is desirable to also execute them on time series data. While this is not a problem on the conceptual level, it can become a bottleneck with regards to query run-time. In general, processing OLAP queries gets more computationally intensive as the volume of data grows. This is a particular problem when querying time series data, which generally contains multiple measures recorded at fine time granularities. Usually, this issue is addressed either by scaling up hardware or by employing workload based query optimization techniques. However, these solutions are either costly or require continuous maintenance. In this paper we propose an approach for approximate OLAP querying of time series that offers constant latency and is maintenance-free. To achieve this, we identify similarities between aggregation cuboids and propose algorithms that eliminate the redundancy these similarities present. In doing so, we can achieve compression rates of up to 80% while maintaining low average errors in the query results.

Details

Original languageEnglish
Title of host publicationProceedings of the 20th International Database Engineering and Applications Symposium, IDEAS 2016
EditorsBipin C. Desai, Evan Desai
PublisherAssociation for Computing Machinery (ACM), New York
Pages205-211
Number of pages7
ISBN (electronic)9781450341189
Publication statusPublished - 11 Jul 2016
Peer-reviewedYes
Externally publishedYes

Conference

Title20th International Database Engineering and Applications Symposium, IDEAS 2016
Duration11 - 13 July 2016
CityMontreal
CountryCanada

External IDs

ORCID /0000-0001-8107-2775/work/142253536

Keywords

Keywords

  • Approximate query processing, Data modeling, Time series