Modeling large time series for efficient approximate query processing

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/GutachtenBeitrag in KonferenzbandBeigetragenBegutachtung

Beitragende

Abstract

Evolving customer requirements and increasing competition force business organizations to store increasing amounts of data and query them for information at any given time. Due to the current growth of data volumes, timely extraction of relevant information becomes more and more difficult with traditional methods. In addition, contemporary Decision Support Systems (DSS) favor faster approximations over slower exact results. Generally speaking, processes that require exchange of data become inefficient when connection bandwidth does not increase as fast as the volume of data. In order to tackle these issues, compression techniques have been introduced in many areas of data processing. In this paper, we outline a new system that does not query complete datasets but instead utilizes models to extract the requested information. For time series data we use Fourier and Cosine transformations and piecewise aggregation to derive the models. These models are initially created from the original data and are kept in the database along with it. Subsequent queries are answered using the stored models rather than scanning and processing the original datasets. In order to support model query processing, we maintain query statistics derived from experiments and when running the system. Our approach can also reduce communication load by exchanging models instead of data. To allow seamless integration of model-based querying into traditional data warehouses, we introduce a SQL compatible query terminology. Our experiments show that querying models is up to 80% faster than querying over the raw data while retaining a high accuracy.

Details

OriginalspracheEnglisch
TitelDatabase Systems for Advanced Applications - DASFAA 2015 International Workshops, SeCoP, BDMS, and Posters, Revised Selected Papers
Redakteure/-innenYoshiharu Ishikawa, Sarana Nutanong, An Liu, Tieyun Qian, Muhammad Aamir Cheema
Herausgeber (Verlag)Springer-Verlag
Seiten190-204
Seitenumfang15
ISBN (elektronisch)978-3-319-22324-7
ISBN (Print)978-3-319-22323-0
PublikationsstatusVeröffentlicht - 2015
Peer-Review-StatusJa

Publikationsreihe

ReiheLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band9052
ISSN0302-9743

Konferenz

Titel2nd International Workshop on Semantic Computing and Personalization, SeCoP 2015, 2nd International Workshop on Big Data Management and Service, BDMS 2015 held in conjunction with 20th International Conference on Database Systems for Advanced Applications, DASFAA 2015
Dauer20 - 23 April 2015
StadtHanoi
LandVietnam

Externe IDs

ORCID /0000-0001-8107-2775/work/198592321

Schlagworte

Forschungsprofillinien der TU Dresden

Fächergruppen, Lehr- und Forschungsbereiche, Fachgebiete nach Destatis