Database Support for Processing Complex Aggregate Queries over Data Streams

Research output: Contribution to conferencesPaperContributedpeer-review

Contributors

  • Yuanzhen Ji - (Author)

Abstract

Over the last few years, the increasing demand on processing streaming data with high throughput and low latency has led to the development of specialized stream processing engines (SPE). Although existing SPEs show high performance in evaluating stateless operations and stateful operations with small windows, their performance degrades significantly when calculating exact answers for complex aggregate queries with huge windows. Examples include correlated aggregations, quantile and ordering statistic computation. Meanwhile, modern database systems have demonstrated the ability of processing complex analytical tasks efficiently over very large datasets, using technologies such as vertical storage, vectorized query execution, etc. This suggests the feasibility of leveraging database systems to assist SPEs to process complex aggregate queries to reduce their evaluation latency.

The goal of this thesis is to investigate the potential of combining database systems with SPEs in the context of stream processing so as to improve the overall query evaluation performance. To this end, the following two major topics will be addressed in this thesis: (1) dynamic migration of complex aggregate operations between the SPE and the database in response to varying system load and (2) efficient evaluation of continuous queries over streaming data that is migrated to the database.

Details

Original languageEnglish
Pages31-37
Number of pages7
Publication statusPublished - 2013
Peer-reviewedYes

Conference

Title Joint EDBT/ICDT 2013 Workshops (EDBT '13), ACM, 2013
Abbreviated titleEDBT'133
Conference number
Duration18 March 2013
Degree of recognitionInternational event
Location
CityGenoa
CountryItaly

External IDs

Scopus 84876798849

Keywords

Research priority areas of TU Dresden