Degraded-First Task Scheduler for MapReduce in Erasure-Coded Storage

Introduction

We have witnessed an increasing adoption of erasure coding in modern clustered storage systems to reduce the storage overhead of traditional 3-way replication. However, it remains an open issue of how to customize the data analytic paradigm for erasure-coded storage, especially when the storage system operates in failure mode. We propose degraded-first scheduling, a new MapReduce scheduling scheme that improves MapReduce performance in erasure-coded clustered storage systems in failure mode. Its main idea is to launch degraded tasks earlier so as to leverage the unused network resources. We conduct mathematical analysis and discrete event simulation to show the performance gain of degraded-first scheduling over Hadoop's default locality-first scheduling. We further implement degraded-first scheduling on Hadoop and conduct testbed experiments in a 13-node cluster. We show that degraded-first scheduling reduces the MapReduce runtime of locality-first scheduling.

Publication

Runhui Li, Patrick P. C. Lee, Yuchong Hu
"Degraded-First Scheduling for MapReduce in Erasure-Coded Storage"
Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2014) (Regular paper), Atlanta, Georgia, June 2014.

Download

A readme file is included in the software package.

ChangeLog
Version 1.0 (April 2014) DegradedFirstTaskScheduler-1.0.0.tar.gz (md5sum: 361b5aec73393d11c1a2375a4b50c683)

People

The software is developed by the Advanced Network and System Research Laboratory in the Department of Computer Science and Engineering at the Chinese University of Hong Kong (CUHK).

Runhui Li (PhD)
Patrick P. C. Lee (Faculty)
Yuchong Hu (Postdoc)

License

The source code of degraded-first scheduling is released under the GNU/GPL license.

Acknowledgments

The work is supported by grants AoE/E-02/08 and ECS CUHK419212 from the University Grants Committee of Hong Kong.