Active Replication at (Almost) No Cost

Publikation: Beitrag zu KonferenzenPaperBeigetragenBegutachtung

Beitragende

Abstract

MapReduce has become a popular programming paradigm in the domain of batch processing systems. Its simplicity allows applications to be highly scalable and to be easily deployed on large clusters. More recently, the MapReduce approach has been also applied to Event Stream Processing (ESP) systems. This approach, which we call StreamMapReduce, enabled many novel applications that require both scalability and low latency. Another recent trend is to move distributed applications to public clouds such as Amazon EC2 rather than running and maintaining private data centers. Most cloud providers charge their customers on an hourly basis rather than on CPU cycles consumed. However, many applications, especially those that process online data, need to limit their CPU utilization to conservative levels (often as low as 50%) to be able to accommodate natural and sudden load variations without causing unacceptable deterioration in responsiveness. In this paper, we present a new fault tolerance approach based on active replication for StreamMapReduce systems. This approach is cost effective for cloud consumers as well as cloud providers. Cost effectiveness is achieved by fully utilizing the acquired computational resources without performance degradation and by reducing the need for additional nodes dedicated to fault tolerance.

Details

OriginalspracheEnglisch
Seiten21-30
Seitenumfang10
PublikationsstatusVeröffentlicht - 2011
Peer-Review-StatusJa

Konferenz

Titel2011 30th IEEE International Symposium on Reliable Distributed Systems
KurztitelSRDS 2011
Veranstaltungsnummer30
Dauer4 - 7 Oktober 2011
BekanntheitsgradInternationale Veranstaltung
StadtMadrid
LandSpanien

Externe IDs

Scopus 83155189012

Schlagworte

Forschungsprofillinien der TU Dresden

DFG-Fachsystematik nach Fachkollegium

Ziele für nachhaltige Entwicklung

Schlagwörter

  • repliication, Fault Tolerance, energy efficiency, mapreduce