0% found this document useful (0 votes)

16 views

Mapreduce Advanced

The document discusses improving performance of MapReduce jobs in heterogeneous cloud computing environments. It describes how task stragglers can negatively impact job completion times. The original Hadoop scheduler uses speculative task execution to mitigate this, but makes assumptions that do not hold in heterogeneous systems. The paper presents a new scheduler called LATE that estimates task completion times to better select speculative tasks for backup. Evaluation on Amazon EC2 shows LATE provides an average 27% speedup over Hadoop and 31% over running without backups.

Uploaded by

ahmadroheed

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Mapreduce Advanced

Uploaded by

ahmadroheed

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

Cloud Computing

MapReduce in Heterogeneous
Environments

Eva Kalyvianaki
[email protected]
Contents

 Looking at MapReduce performance in heterogeneous

clusters
 Material is from the paper:
“Improving MapReduce Performance in Heterogeneous Environments”,
By Matei Zaharia, Andy Konwinski, Anthony D. Joseph, Randy Katz and
Ion Stoica, published in Usenix OSDI conference, 2008
 and their presentation at OSDI

2
Motivation: MapReduce is becoming popular
 Open-source implementation, Hadoop, used by Yahoo!,
Facebook, Last.fm, …
 Scale: 20 PB/day at Google, O(10,000) nodes at Yahoo, 3000
jobs/day at Facebook

3
Stragglers in MapReduce
 Straggler is a node that performs poorly or not performing
at all.
 Original MapReduce mitigation approach was:
 To run a speculative copy (called a backup task)
 Whichever copy or original would finish first would be included

 Without speculative execution, a job would be slow as the

slowest sub-task
 Google notes that speculative execution can improve job
response times by 44%

 Is this approach good enough for modern clusters?

4
Modern Clusters: Heterogeneity is the norm
 Cloud computing providers like Amazon’s Elastic Compute
Cloud (EC2) provide cheap on-demand computing:
 Price: 2 cents / VM / hour
 Scale: thousands of VMs
 Caveat: less control of performance

 Main challenge for Hadoop on EC2 is performance

heterogeneity, which breaks task scheduler assumptions
 This lecture/paper is on a new LATE scheduler that can cut
response time in half

5
MapReduce Revised

6
MapReduce Implementation, Hadoop

7
Scheduling in MapReduce
 When a node has an empty slot, Hadoop chooses one from
the three categories in the following priority:
1. A failed task is given higher priority
2. Unscheduled tasks. For maps, tasks with local data to the node are
chosen first.
3. Looks to run a speculative task.

8
Deciding on Speculative Tasks
 Which task to execute speculatively?
 Hadoop monitors tasks progress using a progress score: a
number from 0, …, 1
 For mappers: the score is the fraction of input data read
 For reducers: the execution is divided into three equal phases,
1/3 of the score each:
 Copy phase: percent of maps that output has been copied from
 Sort phase: map outputs are sorted by key: percent of data merged
 Reduce phase: percent of data passed through the reduce function
 Example: a task halfway through the copy phase has
progress score = 1/2*1/3 = 1/6.
 Example: a task halfway through the reduce phase has
progress score = 1/3 + 1/3 + 1/2 * 1/3 = 5/6
9
Deciding on Speculative Tasks (con’t)

 Hadoop looks at the average progress of each category of

maps and reduces and defines a threshold:
 When a task’s progress is less than the average for its
category minus 0.2, and the task has run at least one
minute, it is marked as a straggler:
threshold = avgProgress – 0.2

 All tasks with progress score < threshold are stragglers

 Ties are broken by data locality
 This approach works reasonably well in homogeneous clusters

10
Scheduler’s Assumptions
1. Nodes can perform work at roughly the same rate
2. Tasks progress at constant rate all the time
3. There is no cost to starting a speculative task
4. A task’s progress is roughly equal to the fraction of its total
work
5. Tasks tend to finish in waves, so a task with a low progress
score is likely a slow task
6. Different task of the same category (maps or reduces) take
roughly the same amount of work

11
Revising Scheduler’s Assumptions
1. Nodes can perform work at roughly the same rate
2. Tasks progress at constant rate all the time

 (1) In heterogeneous clusters some nodes are slower (older)

than others
 (2) Virtualized clusters “suffer” from co-location interference

12
Heterogeneity in Virtualized Environments

 VM technology isolates CPU and memory, but disk and

network are shared
 Full bandwidth when no contention
 Equal shares when there is contention
 2.5x performance difference
70
IO Performance per VM (MB/s)

60
50
40
30
20
10
0
1 2 3 4 5 6 7
VMs on Physical Host
13
Revising Scheduler’s Assumptions
3. There is no cost to starting a speculative task
4. A task’s progress is roughly equal to the fraction of its total
work
5. Tasks tend to finish in waves, so a task with a low progress
score is likely a slow task

 (3) Too many speculative tasks can take away resources

from other running tasks
 (4) The copy phase of reducers is the slowest part, because
it involves all-pairs communications. But this phase counts
for 1/3 of the total reduce work.
 (5) Tasks from different generations will be executed
concurrently. So newer faster tasks are considered with older
show tasks, avgProgress changes a lot.
14
Idea: Progress Rates

 Instead of using progress score values, compute progress

rates, and back up tasks that are “far enough” below the
mean

 Problem: can still select the wrong tasks

15
Progress Rate Example

1 min 2 min

Node 1 1 task/min

Node 2 3x slower

Node 3 1.9x slower

Time (min)

16
Progress Rate Example

What if the job had 5 tasks?

2 min

Node 1

Node 2 time left: 1 min

Node 3 time left: 1.8 min

Time (min)
Node 2 is slowest, but should back up Node 3’s task!
17
Our Scheduler: LATE

 Insight: back up the task with the largest estimated finish

time
 “Longest Approximate Time to End”  LATE
 Look forward instead of looking backward

 Sanity thresholds:
 Cap number of backup tasks
 Launch backups on fast nodes
 Only back up tasks that are sufficiently slow

18
LATE Details

 Estimating finish times:

progress score
progress rate =
execution time

1 – progress score
estimated time left =
progress rate

19
LATE Scheduler

 If a task slot becomes available and there are less than

SpeculativeCap tasks running, then:
1. Ignore the request if the node’s total progress is below
SlowNodeThreshold (=25th percentile)
2. Rank currently running, non-speculatively executed tasks by
estimated time left
3. Launch a copy of the highest-ranked task with progress rate below
SlowTaskThreshold (=25th percentile)

 Threshold values:
 10% cap on backups, 25th percentiles for slow node/task
 Validated by sensitivity analysis

20
LATE Example

2 min

Node 1 Estimated time left:

(1-0.66) / (1/3) = 1

Node 2 Estimated time left:

Progress = 66%
(1-0.05) / (1/1.9) = 1.8
Progress = 5.3%

Node 3

Time (min)

LATE correctly picks Node 3

21
Evaluation

 Environments:
 EC2 (3 job types, 200-250 nodes)
 Small local testbed
 Self-contention through VM placement
 Stragglers through background processes

22
EC2 Sort without Stragglers (Sec 5.2.1)
 106 machines , 7-8 VMs per machine  total of 243 VMs
 128 MB data per host, 30 GB in total
 486 map tasks and 437 reduce tasks
 average 27% speedup over native, 31% over no backups
1.4
Normalized Response Time

1.2

0.8 No Backups
Hadoop Native
0.6
LATE Scheduler
0.4

0.2

0
Worst Best Average
23
EC2 Sort with Stragglers (Sec 5.2.2)
 8 VMs are manually slowed down out of 100 VMs in total
 running background of CPU- and disk-intensive jobs
 average 58% speedup over native, 220% over no backups
 93% max speedup over native
2.5
Normalized Response Time

2.0

1.5
No Backups
Hadoop Native
1.0 LATE Scheduler

0.5

0.0
Worst Best Average
24
Conclusion

 Heterogeneity is a challenge for parallel apps, and is

growing more important
 Lessons:
 Back up tasks which hurt response time most
 2x improvement using simple algorithm

25
Summary

 MapReduce is a very powerful and expressive model

 Performance depends a lot on implementation details
 Material is from the paper:
“Improving MapReduce Performance in Heterogeneous Environments”,
By Matei Zaharia, Andy Konwinski, Anthony D. Joseph, Randy Katz and
Ion Stoica, published in Usenix OSDI conference, 2008
 and their presentation at OSDI

Anatomy of Map-Reduce Jobs PDF
No ratings yet
Anatomy of Map-Reduce Jobs PDF
30 pages
Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
Unit3 MapReduce
No ratings yet
Unit3 MapReduce
7 pages
Big data unit 3 own
No ratings yet
Big data unit 3 own
20 pages
Os 15 02 2022
No ratings yet
Os 15 02 2022
9 pages
Exploiting Stragglers in Distributed Computing Systems With Task Grouping
No ratings yet
Exploiting Stragglers in Distributed Computing Systems With Task Grouping
14 pages
Big Data Analytics Mid 2
No ratings yet
Big Data Analytics Mid 2
9 pages
Ch2_PART4_INTRODUCTIONTOHADOOPANDHADOOPpdf__2024_08_05_18_47_49
No ratings yet
Ch2_PART4_INTRODUCTIONTOHADOOPANDHADOOPpdf__2024_08_05_18_47_49
23 pages
IT 241 Assignment-1
No ratings yet
IT 241 Assignment-1
5 pages
MapReduce Its Applications For Course
No ratings yet
MapReduce Its Applications For Course
36 pages
BDA UNIT -4 notes
No ratings yet
BDA UNIT -4 notes
28 pages
Lecture 9 - Parallel Algorithms
No ratings yet
Lecture 9 - Parallel Algorithms
28 pages
UNIT -4 PPT
No ratings yet
UNIT -4 PPT
50 pages
Chương 6 FreeRTOS
No ratings yet
Chương 6 FreeRTOS
90 pages
Lecture 3 MapReduce Spark
No ratings yet
Lecture 3 MapReduce Spark
62 pages
Linear Learning With Allreduce: John Langford (With Help From Many)
No ratings yet
Linear Learning With Allreduce: John Langford (With Help From Many)
33 pages
Mapreduce
No ratings yet
Mapreduce
5 pages
Parallel Programming
No ratings yet
Parallel Programming
42 pages
System Software and Microprocessor Labmanual
No ratings yet
System Software and Microprocessor Labmanual
130 pages
Big Data Unit 4
No ratings yet
Big Data Unit 4
14 pages
MapReduce Architecture
No ratings yet
MapReduce Architecture
27 pages
Report I-1
No ratings yet
Report I-1
18 pages
Chapter 4 MapReduce and New Software Stack
No ratings yet
Chapter 4 MapReduce and New Software Stack
48 pages
Introduction To MapReduce
No ratings yet
Introduction To MapReduce
26 pages
Com-Sci 111 Eyolfson mt1 Fall20 Id186
No ratings yet
Com-Sci 111 Eyolfson mt1 Fall20 Id186
9 pages
A Weather Dataset. Understanding Hadoop API for MapReduce Framework
No ratings yet
A Weather Dataset. Understanding Hadoop API for MapReduce Framework
9 pages
Cloud Scheduling
No ratings yet
Cloud Scheduling
44 pages
Defense
No ratings yet
Defense
28 pages
2inceptez Hadoop Processing
No ratings yet
2inceptez Hadoop Processing
16 pages
Unit 3-1
No ratings yet
Unit 3-1
65 pages
DC Module 4
No ratings yet
DC Module 4
78 pages
Studies On Performance Aspects of Scheduling Algorithms On Multicore Platforms
No ratings yet
Studies On Performance Aspects of Scheduling Algorithms On Multicore Platforms
7 pages
Hadoop and Big Data Unit 31
No ratings yet
Hadoop and Big Data Unit 31
9 pages
Grados Load Balancing
No ratings yet
Grados Load Balancing
36 pages
Chapter 4
No ratings yet
Chapter 4
71 pages
OS Midterm Solution
100% (2)
OS Midterm Solution
4 pages
Anatomy of Mapreduce Job Run: Some Slides Are Taken From Cmu PPT Presentation
No ratings yet
Anatomy of Mapreduce Job Run: Some Slides Are Taken From Cmu PPT Presentation
73 pages
ECS765P_W3_Hadoop principles and components
No ratings yet
ECS765P_W3_Hadoop principles and components
47 pages
Big Data Unit 2 AKTU Notes
No ratings yet
Big Data Unit 2 AKTU Notes
63 pages
COS 122 Assignment 5 2019
No ratings yet
COS 122 Assignment 5 2019
5 pages
HPC Overview
No ratings yet
HPC Overview
45 pages
Map Reduce
No ratings yet
Map Reduce
25 pages
BDA-U4
No ratings yet
BDA-U4
25 pages
Multiprocessor Real-Time Scheduling
No ratings yet
Multiprocessor Real-Time Scheduling
38 pages
Lecture 5 MapReduce Working
No ratings yet
Lecture 5 MapReduce Working
15 pages
HPC Ut 2
No ratings yet
HPC Ut 2
4 pages
BDA-Unit-II
No ratings yet
BDA-Unit-II
12 pages
How Map Reduce Work
No ratings yet
How Map Reduce Work
99 pages
Resource and Process Management
No ratings yet
Resource and Process Management
98 pages
HPC
No ratings yet
HPC
8 pages
Module 4
No ratings yet
Module 4
37 pages
Lecture 2 - Mapreduce: Cpe 458 - Parallel Programming, Spring 2009
No ratings yet
Lecture 2 - Mapreduce: Cpe 458 - Parallel Programming, Spring 2009
26 pages
P.Prabu (23x61c) CCS334-BDA - Unit-3
No ratings yet
P.Prabu (23x61c) CCS334-BDA - Unit-3
23 pages
He-Phan-Bo - Wyatt-Lloyd - L19-Big-Data - (Cuuduongthancong - Com)
No ratings yet
He-Phan-Bo - Wyatt-Lloyd - L19-Big-Data - (Cuuduongthancong - Com)
16 pages
Tank Monitoring Systems
No ratings yet
Tank Monitoring Systems
69 pages
CH 7
No ratings yet
CH 7
25 pages
OS CDAC Question Paper PDF
No ratings yet
OS CDAC Question Paper PDF
3 pages
Lab 4
No ratings yet
Lab 4
24 pages
RTOS New
No ratings yet
RTOS New
67 pages
Lua Mini Reference: A Hitchhiker's Guide to the Modern Programming Languages, #12
From Everand
Lua Mini Reference: A Hitchhiker's Guide to the Modern Programming Languages, #12
Harry Yoon
No ratings yet
Virtual Ization
No ratings yet
Virtual Ization
42 pages
Lecture - 1
No ratings yet
Lecture - 1
17 pages
Supervisor Visit Form
No ratings yet
Supervisor Visit Form
3 pages
Saturation Attack in SDN
100% (1)
Saturation Attack in SDN
2 pages
Python Programming
No ratings yet
Python Programming
6 pages
EDIDC - Control Record (IDoc) - Lean Test
No ratings yet
EDIDC - Control Record (IDoc) - Lean Test
5 pages
SGGSCC 29062 PDF
No ratings yet
SGGSCC 29062 PDF
3 pages
Ha300 en Col17 Ilt FV Co A4
No ratings yet
Ha300 en Col17 Ilt FV Co A4
21 pages
7 Factors To Consider While Choosing A Web Designing Course.
No ratings yet
7 Factors To Consider While Choosing A Web Designing Course.
3 pages
Course Objectives:: Computer Hardware and Networking Lab
No ratings yet
Course Objectives:: Computer Hardware and Networking Lab
3 pages
Trimble Juno SC Manualzz
No ratings yet
Trimble Juno SC Manualzz
3 pages
Css Pseudo Classes
No ratings yet
Css Pseudo Classes
5 pages
Source Code Esp32 DC Motor
100% (1)
Source Code Esp32 DC Motor
2 pages
My Best Eschool Videos List
No ratings yet
My Best Eschool Videos List
1 page
Wildcard
No ratings yet
Wildcard
2 pages
MarkStamp Ch1 Intro
No ratings yet
MarkStamp Ch1 Intro
23 pages
Grade X AI Term1 QP - Oct - 2024 - Edited
No ratings yet
Grade X AI Term1 QP - Oct - 2024 - Edited
7 pages
Rohit's Resume
No ratings yet
Rohit's Resume
1 page
1 - Assignment - Association Rule Mining
100% (1)
1 - Assignment - Association Rule Mining
3 pages
FortiOS 7.0.9 CLI Reference
No ratings yet
FortiOS 7.0.9 CLI Reference
2,105 pages
Computer Science Sylabus
No ratings yet
Computer Science Sylabus
10 pages
Acx1000 Acx1100 Quick Start
No ratings yet
Acx1000 Acx1100 Quick Start
30 pages
Advantech Evolution of Its IoT Ecosystem Strategy
No ratings yet
Advantech Evolution of Its IoT Ecosystem Strategy
12 pages
Pseudo Code
No ratings yet
Pseudo Code
17 pages
Installing Qmail Server by Badi Ul Zaman
No ratings yet
Installing Qmail Server by Badi Ul Zaman
26 pages
Him Access Profile Update
No ratings yet
Him Access Profile Update
7 pages
Valtrek Sales Presentation
No ratings yet
Valtrek Sales Presentation
22 pages
AVL Tree
No ratings yet
AVL Tree
26 pages
Face Detection System Report
No ratings yet
Face Detection System Report
32 pages
Imagerunner Advance c5051 Series
No ratings yet
Imagerunner Advance c5051 Series
1,455 pages
Deputy Provost and Information Officer: University Portal Project
No ratings yet
Deputy Provost and Information Officer: University Portal Project
9 pages
Instant download (Ebook) Proceedings of the Sixth International Conference on Green and Human Information Technology by Seong Oun Hwang, Syh Yuan Tan, Franklin Bien ISBN 9789811303104, 9789811303111, 981130310X, 9811303118 pdf all chapter
100% (5)
Instant download (Ebook) Proceedings of the Sixth International Conference on Green and Human Information Technology by Seong Oun Hwang, Syh Yuan Tan, Franklin Bien ISBN 9789811303104, 9789811303111, 981130310X, 9811303118 pdf all chapter
57 pages
Quick Start Guide: Get Started in 5 Minutes!
No ratings yet
Quick Start Guide: Get Started in 5 Minutes!
7 pages
Catalogo Elitech
No ratings yet
Catalogo Elitech
13 pages
Unit 16. Assignment 02 - Brief
No ratings yet
Unit 16. Assignment 02 - Brief
39 pages

Mapreduce Advanced

Uploaded by

Mapreduce Advanced

Uploaded by

Cloud Computing

 Looking at MapReduce performance in heterogeneous

 Without speculative execution, a job would be slow as the

 Is this approach good enough for modern clusters?

 Main challenge for Hadoop on EC2 is performance

 Hadoop looks at the average progress of each category of

 All tasks with progress score < threshold are stragglers

 (1) In heterogeneous clusters some nodes are slower (older)

 VM technology isolates CPU and memory, but disk and

 (3) Too many speculative tasks can take away resources

 Instead of using progress score values, compute progress

 Problem: can still select the wrong tasks

Node 3 1.9x slower

What if the job had 5 tasks?

Node 2 time left: 1 min

Node 3 time left: 1.8 min

 Insight: back up the task with the largest estimated finish

 Estimating finish times:

 If a task slot becomes available and there are less than

Node 1 Estimated time left:

Node 2 Estimated time left:

LATE correctly picks Node 3

 Heterogeneity is a challenge for parallel apps, and is

 MapReduce is a very powerful and expressive model

You might also like