文章摘要
Zhang Xiaohong (张霄宏),Feng Shengzhong,Fan Jianping,Huang Zhexue.[J].高技术通讯(英文),2012,18(4):343~349
Method for improving MapReduce performance by prefetching before scheduling
  
DOI:
中文关键词: 
英文关键词: cloud computing, distributed computing, prefetching, MapReduce, scheduling
基金项目:
Author NameAffiliation
Zhang Xiaohong (张霄宏)  
Feng Shengzhong  
Fan Jianping  
Huang Zhexue  
Hits: 628
Download times: 0
中文摘要:
      
英文摘要:
      In this paper, a prefetching technique is proposed to solve the performance problem caused by remote data access delay. In the technique, the map tasks which will cause the delay are predicted first and then the input data of these tasks will be preloaded before the tasks are scheduled. During the execution, the input data can be read from local nodes. Therefore, the delay can be hidden. The technique has been implemented in Hadoop-0.20.1. The experiment results have shown that the technique reduces map tasks causing delay, and improves the performance of Hadoop MapReduce by 20%.
View Full Text   View/Add Comment  Download reader
Close

分享按钮