Database Support for Processing Complex Aggregate Queries over Data Streams

Publikation: Beitrag zu KonferenzenPaperBeigetragenBegutachtung

Beitragende

  • Yuanzhen Ji - (Autor:in)

Abstract

Over the last few years, the increasing demand on processing streaming data with high throughput and low latency has led to the development of specialized stream processing engines (SPE). Although existing SPEs show high performance in evaluating stateless operations and stateful operations with small windows, their performance degrades significantly when calculating exact answers for complex aggregate queries with huge windows. Examples include correlated aggregations, quantile and ordering statistic computation. Meanwhile, modern database systems have demonstrated the ability of processing complex analytical tasks efficiently over very large datasets, using technologies such as vertical storage, vectorized query execution, etc. This suggests the feasibility of leveraging database systems to assist SPEs to process complex aggregate queries to reduce their evaluation latency.

The goal of this thesis is to investigate the potential of combining database systems with SPEs in the context of stream processing so as to improve the overall query evaluation performance. To this end, the following two major topics will be addressed in this thesis: (1) dynamic migration of complex aggregate operations between the SPE and the database in response to varying system load and (2) efficient evaluation of continuous queries over streaming data that is migrated to the database.

Details

OriginalspracheEnglisch
Seiten31-37
Seitenumfang7
PublikationsstatusVeröffentlicht - 2013
Peer-Review-StatusJa

Konferenz

Titel Joint EDBT/ICDT 2013 Workshops (EDBT '13), ACM, 2013
KurztitelEDBT'133
Veranstaltungsnummer
Dauer18 März 2013
BekanntheitsgradInternationale Veranstaltung
Ort
StadtGenoa
LandItalien

Externe IDs

Scopus 84876798849

Schlagworte

Forschungsprofillinien der TU Dresden