Title

Power and Energy Characteristics of MapReduce Data Movements

Document Type

Conference Proceeding

Language

eng

Format of Original

7 p.

Publication Date

6-2013

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Source Publication

2013 International Green Computing Conference (IGCC)

Source ISSN

978-1-4799-0623-9

Original Item ID

doi: 10.1109/IGCC.2013.6604489

Abstract

Worldwide data centers consume about 300 billion kWh of energy per year, which accounts for 2% of total electricity use. As MapReduce becomes the mainstream paradigm for data-intensive computing in data centers, optimizing MapReduce energy efficiency can greatly mitigate energy requirements and reduce energy bills. Numerous studies have attempted to improve MapReduce energy efficiency, but few have approached this problem from understanding and reducing the energy impact of data movements. As data movements are often performance and energy bottlenecks, we propose a data movement centric approach and present an analysis framework with methods and metrics for evaluating costly built-in MapReduce data movements. Our experimental investigation leverages the fine-grained performance and power profiling framework eTune and reveals unique system-level and component-level energy characteristics of data movements. It also shows the scalability of energy efficiency with MapReduce workload and system parameters. These energy characteristics can be exploited in system design and resource allocation to improve data-intensive computing energy efficiency.

Comments

Published as part of the proceedings of the conference, 2013 International Green Computing Conference (IGCC), 2013: 1-7. DOI.