A Virtualization-Based Hybrid Storage System for a Map-Reduce Framework

Aseffa DEREJE TEKILU, Chin-Hsien WU

  • Full Text Views

    0

  • Cite this

Summary :

A map-reduce framework is popular for big data analysis. In the typical map-reduce framework, both master node and worker nodes can use hard-disk drives (HDDs) as local disks for the map-reduce computation. However, because of the inherit mechanical problems of HDDs, the I/O performance is a bottleneck for the map-reduce framework when I/O-intensive applications (e.g., sorting) are performed. Replacing HDDs with solid-state drives (SSDs) is not economical, although SSDs have better performance than HDDs. In this paper, we propose a virtualization-based hybrid storage system for the map-reduce framework. The objective of the paper is to combine the advantages of the fast access property of SSDs and the low cost of HDDs by realizing an economical design and improving I/O performance of a map-reduce framework in a virtualization environment. We propose three storage combinations: SSD-based, HDD-based, and a hybrid of SSD-based and HDD-based storage systems which balances speed, capacity, and lifetime. According to experiments, the hybrid of SSD-based and HDD-based storage systems offers superior performance and economy.

Publication
IEICE TRANSACTIONS on Information Vol.E99-D No.9 pp.2248-2258
Publication Date
2016/09/01
Publicized
2016/05/25
Online ISSN
1745-1361
DOI
10.1587/transinf.2015EDP7365
Type of Manuscript
PAPER
Category
Software System

Authors

Aseffa DEREJE TEKILU
  National Taiwan University of Science and Technology
Chin-Hsien WU
  National Taiwan University of Science and Technology

Keyword

FlyerIEICE has prepared a flyer regarding multilingual services. Please use the one in your native language.