0% found this document useful (0 votes)

70 views

Spark Optimizations & Deployment

This document discusses Spark optimizations and deployment. It covers wide and narrow transformations in Spark, with narrow transformations involving local computations on partition blocks while wide transformations require data from all partitions. It also discusses optimizations like RDD persistence to avoid recomputing RDDs, co-partitioning to avoid wide transformations, and partitioning strategies. Finally, it mentions deployment of Spark applications on clusters and clouds.

Uploaded by

Othman Farhaoui

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views

Spark Optimizations & Deployment

Uploaded by

Othman Farhaoui

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

08/09/2021

Big Data
Spark optimizations & deployment

Stéphane Vialle & Gianluca Quercini

Spark optimizations & deployment

1. Wide and Narrow transformations
2. Optimizations
3. Page Rank example
4. Deployment on clusters & clouds

1
08/09/2021

Wide and Narrow transformations
Narrow transformations
• Local computations applied to each partition block
 no communication between processes (or nodes)
 only local dependencies (between parent & son RDDs)
•Map() •Union()
•Filter()

RDD RDD

• In case of sequence of Narrow transformations:
possible pipelining inside one step

Map() Filter() Map(); Filter()

RDD RDD

• In case of failure:
 recompute only the damaged partition blocks
 recompute/reload only its parent blocks

Lineage

RDD RDD Source : Stack Overflow

2
08/09/2021

Wide and Narrow transformations
Wide transformations
• Computations requiring data from all parent RDD blocks
 many comms between processes (and nodes) (shuffle & sort)
 non‐local dependencies (between parent & son RDDs)
•groupByKey()
•reduceByKey()

• In case of sequence of transformations:
 no pipelining of transformations
 wide transformation must be totally achieved before to enter
next transformation reduceByKey filter

• In case of sequence of failure:
 recompute the damaged partition blocks
 recompute/reload all blocks of the parent RDDs

3
08/09/2021

Wide and Narrow transformations
Avoiding wide transformations with co‐partitioning
• With identical partitioning of inputs:
wide transforma on → narrow transformation

Join with inputs Join with inputs

not co‐partitioned co‐partitioned

• less expensive communications Control RDD partitioning

• possible pipelining Force co‐partitioning
• less expensive fault tolerance (using the same partition map)

Spark optimizations & deployment

1. Wide and Narrow transformations
2. Optimizations
• RDD Persistence
• RDD Co‐partitionning
• RDD controlled distribution
• Traffic minimization
• Maintaining parallelism
3. Page Rank example
4. Deployment on clusters & clouds

4
08/09/2021

Optimizations: persistence
Persistence of the RDD
RDD are stored:
• in the memory space of the Spark Executors
• or on disk (of the node) when memory space of the Executor is full
By default: an old RDD is removed when memory space is required
(Least Recently Used policy)

 An old RDD has to be re‐

computed (using its lineage)
when needed again Lineage

 Spark allows to make a
« persistent » RDD to
avoid to recompute it Source : Stack Overflow

Optimizations: persistence
Persistence of the RDD to improve Spark application performances
Spark application developper has to add instructions to force RDD
storage, and to force RDD forgetting:
myRDD.persist(StorageLevel) // or myRDD.cache()
… // Transformations and Actions
myRDD.unpersist()

Available storage levels:

• MEMORY_ONLY : in Spark Executor memory space
• MEMORY_ONLY_SER : + serializing the RDD data
• MEMORY_AND_DISK : on local disk when no memory space
• MEMORY_AND_DISK_SER : + serializing the RDD data in memory
• DISK_ONLY : always on disk (and serialized)
RDD is saved in the Spark executor memory/disk space
 limited to the Spark session

5
08/09/2021

Optimizations: persistence
Persistence of the RDD to improve fault tolerance
To face short term failures: Spark application developper can force
RDD storage with replication in the local memory/disk of several
Spark Executors
myRDD.persist(storageLevel.MEMORY_AND_DISK_SER_2)
… // Transformations and Actions
myRDD.unpersist()

To face serious failures: Spark application developper can checkpoint

the RDD outside of the Spark data space, on HDFS or S3 or…
myRDD.sparkContext.setCheckpointDir(directory)
myRDD.checkpoint()
… // Transformations and Actions

 Longer, but secure!

Spark optimizations & deployment

6
08/09/2021

Optimizations: RDD co‐partitionning
5 main internal properties of a RDD:
• A list of partition blocks
getPartitions()
• A function for computing each partition block
To compute and
compute(…)
re‐compute the
• A list of dependencies on other RDDs: parent RDD when failure
RDDs and transformations to apply happens
getDependencies()

Optionally: To control the
• A Partitioner for key‐value RDDs: metadata RDD partitioning,
specifying the RDD partitioning to achieve co‐
partitioner() partitioning…
• A list of nodes where each partition block To improve data
can be accessed faster due to data locality locality with
getPreferredLocations(…) HDFS & YARN…

Optimizations: RDD co‐partitionning
Specify a « partitioner »

val rdd2 = rdd1

.partitionBy(new HashPartitioner(100))
.persist()

Creates a new RDD (rdd2):
• partitionned according to hash partitionner strategy
• on 100 Spark Executors
 Redistribute the RDD (rdd1  rdd2)
 WIDE (expensive) transformation
• Do not keep the original partition (rdd1) in memory / on disk
• keep the new partition (rrd2) in memory / on disk
 to avoid to repeat a WIDE transformation when rdd2 is re‐used

7
08/09/2021

Optimizations: RDD co‐partitionning
Specify a « partitioner »

val rdd2 = rdd1

.partitionBy(new HashPartitioner(100))
.persist()
Partitionners:
• Hash partitioner :
Key0, Key0+100, Key0+200… on one Spark Executor
• Range partitioner :
[Key‐min ; Key‐max] on one Spark Executor
• Custom partitioner (develop your own partitioner) :
Ex : Key = URL, hash partitioned
BUT : hash only the domain name of the URL
 all pages of the same domain on the same Spark
Executor because they are frequently linked
15

Optimizations: RDD co‐partitionning
Avoid repetitive WIDE transformations on large data sets
Repeated op.
Same
partitioner
used on
Partitioner same set of
specified keys

B
Re‐partition Repeated op.
One time
Narrow
A A.join(B)
Wide Wide
• Make ONE Wide op (one time) to
avoid many Wide ops
• An explicit partitioning « propagates » B
to the transformation result
• Replace Wide op by Narrow op
• Do not re‐partition a RDD to use only A A’ A’.join(B)
once! Wide Wide

8
08/09/2021

Optimizations: RDD co‐partitionning
Co‐paritioning
Repeated op.
Use the same partitioner
Avoid to repeat Wide op.

B
Repeated op.

Wide
A Wide A’ A’.join(B)
Narrow
Created
with the
right
partitioning

A A’ A’.join(B) B
Wide Narrow Narrow

Optimizations: RDD co‐partitionning
PageRank with partitioner (see further)
Val links = …… // previous code
val links1 = links.partitionBy(new HashPartitioner(100)).persist()

var ranks = links1.mapValues(v => 1.0)

for (i <- 1 to iters) {

val contribs =
links1.join(ranks)
.flatMap{ case (url (urlLinks, rank)) =>
urlLinks.map(dest => (dest,rank/urlLinks.size))}
ranks = contribs.reduceByKey(_ + _).mapValues(0.15 + 0.85 * _)
}

• Initial links and ranks are co‐partitioned

• Repeated join is Narrow‐Wide
• Repeated mapValues is Narrow: respects the reduceByKey partitioning

• Pb: flatMap{…urlinks.map(…)} can change the partitionning ?!

9
08/09/2021

Spark optimizations & deployment

Optimization: RDD distribution
Create and distribute a RDD
• By default: level of parallelism set by the nb of partition blocks
of the input RDD
• When the input is a in‐memory collection (list, array…), it needs
to be parallelized:
val theData = List(("a",1), ("b",2), ("c",3),……)
sc.parallelize(theData).theTransformation(…)
Or :
val theData = List(1,2,3,……).par
theData.theTransformation(…)

 Spark adopts a distribution adapted to the cluster…

… but it can be tuned

10
08/09/2021

Optimization: RDD distribution
Control of the RDD distribution
• Most of transformations support an extra parameter to control
the distribution (and the parallelism)

• Example:
Default parallelism:
val theData = List(("a",1), ("b",2), ("c",3),……)
sc.parallelize(theData).reduceByKey((x,y) => x+y)

Tuned parallelism:
val theData = List(("a",1), ("b",2), ("c",3),……)
sc.parallelize(theData).reduceByKey((x,y) => x+y,8)

8 partition blocks imposed for
the result of the reduceByKey

Spark optimizations & deployment

11
08/09/2021

Optimization: traffic minimization
RDD redistribution: rdd : {(1, 2), (3, 3), (3, 4)}
Scala : rdd.groupByKey()  rdd: {(1, [2]), (3, [3, 4])}
Group values associated to the same key

 Move almost all input data
shuffle
 Huge trafic in the shuffle step !!

groupByKey will be time consumming:

• no computation time…
• … but huge traffic on the network
of the cluster/cloud

 Optimize computations and communications

in a Spark program

Optimization: traffic minimization
RDD reduction: rdd : {(1, 2), (3, 3), (3, 4)}
Scala : rdd.reduceByKey((x,y) => x+y)  rdd: {(1, 2), (3, 7)}
Reduce values associated to the same key

((x,y) => x+y):

1 int + 1 int  1 int
 Limited trafic in the shuffle step shuffle

But: ((x,y) => x+y):  TD‐1

1 list + 1 list  1 longer list

12
08/09/2021

Optimization: traffic minimization
RDD reduction with different input and reduced datatypes:

Scala : rdd.aggregateByKey(init_acc)(
…, // mergeValueAccumulator fct

…, // mergeAccumulators fct
)

Scala : rdd.combineByKey(
…, // createAccumulator fct

…, // mergeValueAccumulator fct

…, // mergeAccumulators fct shuffle

)

Spark optimizations & deployment

13
08/09/2021

Optimization: maintaining parallelism
Computing an average value per key in parallel
theMarks: {(‘’julie’’, 12), (‘’marc’’, 10), (‘’albert’’, 19), (‘’julie’’, 15), (‘’albert’’, 15),…}

• Solution 1: mapValues + reduceByKey + collectAsMap + foreach

val theSums = theMarks
.mapValues(v => (v, 1))
.reduceByKey((vc1, vc2) => (vc1._1 + vc2._1,
vc1._2 + vc2._2))
.collectAsMap() // Return a ‘Map’ datastructure
ACTION  Break parallelism! Bad performances!
theSums.foreach(
kvc => println(kvc._1 +
" has average:" +
Sequential computing ! kvc._2._1/kvc._2._2.toDouble))

• Solution 2: combineByKey + collectAsMap + foreach

val theSums = theMarks
.combineByKey(
// createCombiner function
Type (valueWithNewKey) => (valueWithNewKey, 1),
inference // mergeValue function (inside a partition block)
needs (acc:(Int, Int), v) =>(acc._1 + v, acc._2 + 1),
some // mergeCombiners function (after shuffle comm.)
help! (acc1:(Int, Int), acc2:(Int, Int)) =>
(acc1._1 + acc2._1, acc1._2 + acc2._2))
.collectAsMap() Still bad performances! (Break parallelism)
theSums.foreach(
kvc => println(kvc._1 + " has average:" +
Still sequential ! kvc._2._1/kvc._2._2.toDouble))

14
08/09/2021

• Solution 2: combineByKey + map + collectAsMap + foreach

val theSums = theMarks
.combineByKey(
// createCombiner function
(valueWithNewKey) => (valueWithNewKey, 1),
// mergeValue function (inside a partition block)
(acc:(Int, Int), v) =>(acc._1 + v, acc._2 + 1),
Transformation:
// mergeCombiners function (after shuffle comm.)
compute in
parallel and (acc1:(Int, Int), acc2:(Int, Int)) =>
return a RDD (acc1._1 + acc2._1, acc1._2 + acc2._2))
.map{case (k,vc) => (k, vc._1/vc._2.toDouble)}

theSums.collectAsMap().foreach( Action: at the end (just to print)

kv => println(kv._1 + " has average:" + kv._2))

Spark optimizations & deployment

1. Wide and Narrow transformations
2. Optimizations
3. Page Rank example
4. Deployment on clusters & clouds
• Task DAG execution
• Spark execution on clusters
• Ex of Spark execution on cloud

15
08/09/2021

PageRank with Spark
PageRank objectives
Important URL
(referenced by
Compute the probability to many pages)
arrive at a web page when Rank increases
url 1 (referenced by an
randomly clicking on web
important URL)
links…
url 2 url 4

url 3

• If a URL is referenced by many other URLs then its rank increases
(because being referenced means that it is important – ex: URL 1)
• If an important URL (like URL 1) references other URLs (like URL 4)
this will increase the destination’s ranking

PageRank with Spark
PageRank principles
• Simplified algorithm:
𝐵 𝑢 : the set containing all
pages linking to page u
𝑃𝑅 𝑣
𝑃𝑅 𝑢 𝑃𝑅 𝑥 : PageRank of page x
𝐿 𝑣
∈
𝐿 𝑣 : the number of outbound
links of page v

Contribution of page v
to the rank of page u

• Initialize the PR of each page with an equi‐probablity

• Iterate k times:
compute PR of each page

16
08/09/2021

PageRank with Spark
PageRank principles
• The damping factor:
the probability a user continues to click is a damping factor: d
𝑁 : Nb of documents
1 𝑑 𝑃𝑅 𝑣 in the collection
𝑃𝑅 𝑢 𝑑.
𝑁 𝐿 𝑣
∈ Usually : d = 0.85

Sum of all PR is 1

Variant:
𝑃𝑅 𝑣
𝑃𝑅 𝑢 1 𝑑 𝑑. Usually : d = 0.85
𝐿 𝑣
∈

Sum of all PR is Npages

PageRank with Spark
PageRank first step in Spark (Scala)
// read text file into Dataset[String] -> RDD1
val lines = spark.read.textFile(args(0)).rdd

val pairs = lines.map{ s =>

// Splits a line into an array of
// 2 elements according space(s)
val parts = s.split("\\s+")
// create the parts<url, url>
// for each line in the file
(parts(0), parts(1))
}
// RDD1 <string, string> -> RDD2<string, iterable>
val links = pairs.distinct().groupByKey().cache()
‘’url 4  url 3’’
‘’url 4  url 1’’ links RDD url 4 [url 3, url 1]
‘’url 2  url 1’’ url 3 [url 2, url 1]
‘’url 1  url 4’’ url 2 [url 1]
‘’url 3  url 2’’ url 1 [url 4]
‘’url 3  utl 1’’

17
08/09/2021

PageRank with Spark
url 1
PageRank second step in Spark (Scala) url 2 url 4
Initialization with 1/N equi‐probability: url 3
// links <key, Iter> RDD  ranks <key,1.0/Npages> RDD
var ranks = links.mapValues(v => 1.0/4.0)

links.mapValues(…) is an immutable RDD
var ranks is a mutable variable
var ranks = RDD1
ranks = RDD2
« ranks » is re‐associated to a new RDD
RDD1 is forgotten …
…and will be removed from memory
Other strategy:
// links <key, Iter> RDD  ranks <key,one> RDD
var ranks = links.mapValues(v => 1.0)
links RDD url 4 [url 3, url 1] ranks RDD url 4 1.0
url 3 [url 2, url 1] url 3 1.0
url 2 [url 1] url 2 1.0
url 1 [url 4] url 1 1.0

PageRank with Spark
url 1
PageRank third step in Spark (Scala) url 2 url 4
for (i <- 1 to iters) {
val contribs = url 3
links.join(ranks)
.flatMap{ case (url (urlLinks, rank)) =>
urlLinks.map(dest => (dest, rank/urlLinks.size)) }
ranks = contribs.reduceByKey(_ + _)
.mapValues(0.15 + 0.85 * _)
}
links RDD Output links RDD’
url 4 [url 3, url 1] url 4 ([url 3, url 1], 1.0)
url 3 [url 2, url 1] url 3 ([url 2, url 1], 1.0) contribs RDD
url 2 [url 1] url 2 ([url 1], 1.0) .flatmap url 3 0.5
url 1 [url 4] url 1 ([url 4], 1.0) url 1 0.5
.join Output links & url 2 0.5
url 4 1.0 contributions url 1 0.5
url 3 1.0 url 1 1.0
url 2 1.0 url 4 1.0 url 3 0.5 .reduceByKey url 4 1.0
.mapValues url 1 2.0
url 1 1.0 url 3 0.57
url 2 0.5 individual input
ranks RDD url 2 0.57
url 4 1.0 contributions
url 1 1.849
new ranks RDD Individual & cumulated
var ranks (with damping factor) input contributions
36

18
08/09/2021

PageRank with Spark
PageRank third step in Spark (Scala)
• Spark & Scala allow a short/compact implementation of the
PageRank algorithm
• Each RDD remains in‐memory from one iteration to the next one

val lines = spark.read.textFile(args(0)).rdd

val pairs = lines.map{ s =>
val parts = s.split("\\s+")
(parts(0), parts(1)) }
val links = pairs.distinct().groupByKey().cache()

var ranks = links.mapValues(v => 1.0)

for (i <- 1 to iters) {

val contribs =
links.join(ranks)
.flatMap{ case (url (urlLinks, rank)) =>
urlLinks.map(dest => (dest,rank/urlLinks.size))}
ranks = contribs.reduceByKey(_ + _).mapValues(0.15 + 0.85 * _)
}

PageRank with Spark
PageRank third step in Spark (Scala): optimized with partitioner
Val links = …… // previous code
val links1 = links.partitionBy(new HashPartitioner(100)).persist()

var ranks = links1.mapValues(v => 1.0)

for (i <- 1 to iters) {

val contribs =
links1.join(ranks)
.flatMap{ case (url (urlLinks, rank)) =>
urlLinks.map(dest => (dest,rank/urlLinks.size))}
ranks = contribs.reduceByKey(_ + _).mapValues(0.15 + 0.85 * _)
}

• Initial links and ranks are co‐partitioned

• Repeated join is Narrow‐Wide
• Repeated mapValues is Narrow: respects the reduceByKey partitioning

• Pb: flatMap{…urlinks.map(…)} can change the partitionning ?!

19
08/09/2021

Spark optimizations & deployment

1. Wide and Narrow transformations
2. Optimizations
3. Page Rank example
4. Deployment on clusters & clouds
• Task DAG execution
• Spark execution on clusters
• Ex of Spark execution on cloud

Task DAG execution
• A RDD is a dataset distributed among the Spark compute nodes
• Transformations are lazy operations: saved and executed further
• Actions trigger the execution of the sequence of transformations

A job is a sequence of RDD map

RDD transformations, mapValues
Transformation
ended by an action reduceByKey
RDD …

Action
Result

A Spark application is a set of jobs to run sequentially or in parallel

 A DAG of tasks

20
08/09/2021

Task DAG execution
The Spark application driver controls the application run
• It creates the Spark context
• It analyses the Spark program

• It creates a DAG of tasks for each job

• It optimizes the DAG
− pipelining narrow transformations
− identifying the tasks that can be run in parallel

• It schedules the DAG of tasks on the available worker nodes

(the Spark Executors) in order to maximize parallelism (and
to reduce the execution time)

Task DAG execution
Spark job trace: on 10 Spark executors, with 3GB input file
DAGScheduler: Submitting 24 missing tasks from ShuffleMapStage 0 ...
Submitting the 10
TaskSchedulerImpl: Adding task set 0.0 with 24 tasks first tasks on the
... 10 Spark executor
processes
TaskSetManager: Starting task 1.0 in stage 0.0 (TID 1, 172.20.10.14, executor 0, partition 1, ...)
TaskSetManager: Starting task 2.0 in stage 0.0 (TID 2, 172.20.10.11, executor 7, partition 2, ...)
...
TaskSetManager: Starting task 10.0 in stage 0.0 (TID 10, 172.20.10.11, executor 7, partition 10, ...)

TaskSetManager: Finished task 2.0 in stage 0.0 (TID 2) in 18274 ms … (executor 7) (1/24)
TaskSetManager: Starting task 11.0 in stage 0.0 (TID 11, 172.20.10.7, executor 8, partition 11, ...)
TaskSetManager: Finished task 8.0 in stage 0.0 (TID 8) in 18459 ms … (executor 8) (2/24)
...
Submitting a new
TaskSchedulerImpl: Removed TaskSet 0.0, whose tasks task when a
have all completed, from pool previous one has
... finished
End of task graph
execution

21
08/09/2021

Task DAG execution
Execution time as a function of the number of Spark executors
Ex. of Spark application run: Spark pgm run on 1-15 nodes
• from 1 up to 15 executors 512
• with 1 executor per node
256
Good overall decrease but

Exec Time(s)
plateaus appear ! 128
Probable load balancing
64
problem…
32
1 2 4 8 16
Ex: a graph of 4 parallel tasks
Nb of nodes

on 1 on 2 on 3

node: T nodes: T/2 nodes: T/2 A plateau appears

Spark optimizations & deployment

1. Wide and Narrow transformations
2. Optimizations
3. Page Rank example
4. Deployment on clusters & clouds
• Task DAG execution
• Spark execution on clusters
• Using the Spark cluster manager (standalone mode)
• Using YARN as cluster manager
• Using Mesos as cluster manager
• Ex of Spark execution on cloud

22
08/09/2021

Using the Spark Master as cluster
manager (standalone mode)
spark-submit --master spark://node:port … myApp

Spark Master Cluster worker node

Cluster worker node
 Cluster Cluster worker node
Manager Cluster worker node
Cluster worker node
Cluster worker node

Spark cluster configuration:
• Add the list of cluster worker nodes in the Spark Master config.
• Specify the maximum amount of memory per Spark Executor
spark-submit --executor-memory XX …
• Specify the total amount of CPU cores used to process one
Spark application (through all its Spark executors)
spark-submit --total-executor-cores YY …

Using the Spark Master as cluster
manager (standalone mode)
spark-submit --master spark://node:port … myApp

Spark Master Cluster worker node

Cluster worker node
 Cluster Cluster worker node
Manager Cluster worker node
Cluster worker node
Cluster worker node

Spark cluster configuration:
• Default config :
− (only) 1GB/Spark Executor
− Unlimited nb of CPU cores per application execution
− The Spark Master creates one mono‐core Executor on all
Worker nodes to process each job …
• You can limit the total nb of cores per job
• You can concentrate the cores into few multi‐core Executors

23
08/09/2021

Using the Spark Master as cluster
manager (standalone mode)
spark-submit --master spark://node:port … myApp

Spark Master Cluster worker node

Cluster worker node
 Cluster Cluster worker node
Manager Cluster worker node
Cluster worker node
Cluster worker node

Spark cluster configuration:
• Default config :
− (only) 1GB/Spark Executor
− Unlimited nb of CPU cores per application execution
one multi‐core Executor
− The Spark Master creates one mono‐core Executor on all
on all
job (invading all cores!)
Worker nodes to process each job
• You can limit the total nb of cores per job
• You can concentrate the cores into few multi‐core Executors

Using the Spark Master as cluster
manager (standalone mode)
spark-submit --master spark://node:port … myApp

Spark Master Cluster worker node

Cluster worker node
 Cluster Cluster worker node
Manager Cluster worker node
Cluster worker node
Cluster worker node

Client deployment mode:

Spark Master
 Cluster
Manager

Spark app. Driver Spark Spark Spark

• DAG builder executor executor executor
• DAG scheduler‐
optimizer Spark Spark Spark
• Task scheduler executor executor executor
Interactive control of the application: development mode

24
08/09/2021

Using the Spark Master as cluster
manager (standalone mode)
spark-submit --master spark://node:port … myApp

Spark Master Cluster worker node

Cluster worker node
 Cluster Cluster worker node
Manager Cluster worker node
Cluster worker node
Cluster worker node

Cluster deployment mode:

Spark Master Spark app. Driver
 Cluster • DAG builder
Manager • DAG scheduler‐optimizer
• Task scheduler

Spark Spark
Laptop connection executor executor
can be turn off: Spark
Spark Spark
production mode executor executor executor

Using the Spark Master as cluster
manager (standalone mode)
spark-submit --master spark://node:port … myApp

Spark Master Cluster worker node

 Cluster & Hadoop Data Node
Manager
Cluster worker node
& Hadoop Data Node

HDFS
Name Node Cluster worker node
& Hadoop Data Node

The Cluster Worker nodes should be the Data nodes, storing initial

RDD values or new generated (and saved) RDD
 Will improve the global data‐computations locality
 When using HDFS: the Hadoop data nodes should be
re‐used as worker nodes for Spark Executors

25
08/09/2021

Using the Spark Master as cluster
manager (standalone mode)
spark-submit --master spark://node:port … myApp

Spark Master Cluster worker node

 Cluster & Hadoop Data Node
Manager
Cluster worker node
& Hadoop Data Node

HDFS
Name Node Cluster worker node
& Hadoop Data Node

The Cluster Worker nodes should be the Data nodes, storing initial

RDD values or new generated (and saved) RDD
When using the Spark Master as Cluster Manager:
…there is no way to localize the Spark Executors on the
data nodes hosting the right RDD blocks!

Using the Spark Master as cluster
manager (standalone mode)
spark-submit --master spark://node:port … myApp

Spark Master Cluster worker node

 Cluster & Hadoop Data Node
Manager
Cluster worker node
& Hadoop Data Node
HDFS
Name Node Cluster worker node
& Hadoop Data Node

Cluster
Spark Master Spark app. Driver
deployment mode:  Cluster • DAG builder
Manager • DAG scheduler‐optimizer
• Task scheduler

HDFS Spark Spark

Name Node executor executor
Spark Spark Spark
executor executor executor

26
08/09/2021

Using the Spark Master as cluster
manager (standalone mode)
spark-submit --master spark://node:port … myApp

Spark Master Cluster worker node

 Cluster & Hadoop Data Node
Manager
Cluster worker node
& Hadoop Data Node
HDFS
Name Node Cluster worker node
& Hadoop Data Node

Strenght and weakness of standalone mode:

• Nothing more to install (included in Spark)
• Easy to configure
• Can run different jobs concurrently
• Can not share the cluster with non‐Spark applications
• Can not launch Executors on the data nodes hosting input data
• Limited scheduling mechanism (unique queue)

Spark optimizations & deployment

27
08/09/2021

Using YARN as cluster manager

export HADOOP_CONF_DIR = ${HADOOP_HOME}/conf
spark-submit --master yarn … myApp
YARN Cluster worker node
Resource & Hadoop Data Node
Manager
Cluster worker node
& Hadoop Data Node

HDFS
Name Node Cluster worker node
& Hadoop Data Node

Spark cluster configuration:
• Add an env. variable defining the path to Hadoop conf directory
• Specify the maximum amount of memory per Spark Executor
• Specify the amount of CPU cores used per Spark executor
spark-submit --executor-cores YY …
• Specify the nb of Spark Executors per job: --num-executors

Using YARN as cluster manager

export HADOOP_CONF_DIR = ${HADOOP_HOME}/conf
spark-submit --master yarn … myApp
YARN Cluster worker node
Resource & Hadoop Data Node
Manager
Cluster worker node
& Hadoop Data Node

HDFS
Name Node Cluster worker node
& Hadoop Data Node

Spark cluster configuration:
• By default:
− (only) 1GB/Spark Executor
− (only) 1 CPU core per Spark Executor
− (only) 2 Spark Executors per job
• Usually better with few large Executors (RAM & nb of cores)…

28
08/09/2021

Using YARN as cluster manager

export HADOOP_CONF_DIR = ${HADOOP_HOME}/conf
spark-submit --master yarn … myApp
YARN Cluster worker node
Resource & Hadoop Data Node
Manager
Cluster worker node
& Hadoop Data Node

HDFS
Name Node Cluster worker node
& Hadoop Data Node

Spark cluster configuration:
• Link Spark RDD meta‐data « prefered locations » to HDFS meta‐
data about « localization of the input file blocks »
val sc = new SparkContext(sparkConf, Spark Context
InputFormatInfo.computePreferredLocations(
Seq(new InputFormatInfo(conf, construction
classOf[org.apache.hadoop.mapred.TextInputFormat], hdfspath ))…

Using YARN as cluster manager

export HADOOP_CONF_DIR = ${HADOOP_HOME}/conf
spark-submit --master yarn … myApp
YARN Cluster worker node
Resource & Hadoop Data Node
Manager
Cluster worker node
& Hadoop Data Node
HDFS
Name Node Cluster worker node
& Hadoop Data Node

Client deployment
YARN
mode: Resource App. Master
Manager Executor launcher

HDFS
Spark Driver Name Node
• DAG builder
• DAG scheduler‐
optimizer
• Task scheduler

29
08/09/2021

Using YARN as cluster manager

Client deployment
YARN
mode: Resource App. Master
Manager « Executor » launcher

HDFS Spark
Spark Driver Name Node executor
• DAG builder
• DAG scheduler‐
optimizer Spark
• Task scheduler executor

Using YARN as cluster manager

Cluster deployment
mode: YARN App. Master / Spark Driver
Resource • DAG builder
Manager • DAG scheduler‐optimizer
• Task scheduler

HDFS Spark
Name Node executor
Spark
executor

30
08/09/2021

Using YARN as cluster manager

YARN vs standalone Spark Master:
• Usually available on HADOOP/HDFS clusters
• Allows to run Spark and other kinds of applications on HDFS
(better to share a Hadoop cluster)
• Advanced application scheduling mechanisms
(multiple queues, managing priorities…)

Using YARN as cluster manager

YARN vs standalone Spark Master:
• Improvement of the data‐computation locality…but is it critical ?
− Spark reads/writes only input/output RDD from Disk/HDFS
− Spark keeps intermediate RDD in‐memory
− With cheap disks: disk‐IO time > network time
 Better to deploy many Executors on unloaded nodes ?

31
08/09/2021

Spark optimizations & deployment

Using MESOS as cluster manager

spark-submit --master mesos://node:port … myApp
Cluster worker node
Mesos Master & Hadoop Data Node
 Cluster
Manager Cluster worker node
& Hadoop Data Node
HDFS Cluster worker node
Name Node
& Hadoop Data Node

Mesos is a generic cluster manager

• Supporting to run both:
− short term distributed computations
− long term services (like web services)
• Compatible with HDFS

32
08/09/2021

Using MESOS cluster manager

• Specify the maximum amount of memory per Spark Executor

spark-submit --executor-memory XX …
• Specify the total amount of CPU cores used to process one Spark
application (through all its Spark executors)
spark-submit --total-executor-cores YY …
• Default config:
− create few Executors with max nb of cores  like standalone…
− use all available cores to process each job …in 2019
65

Using MESOS as cluster manager

Client deployment
mode: Mesos Master With just Mesos:
 Cluster • No Application Master
Manager
• No Input Data – Executor locality

HDFS Spark Spark

Spark Driver Name Node executor executor
• DAG builder
• DAG scheduler‐
optimizer
• Task scheduler

33
08/09/2021

Using MESOS as cluster manager

Cluster deployment
mode: Mesos Master Spark Driver
 Cluster • DAG builder
Manager • DAG scheduler‐
optimizer
• Task scheduler

HDFS
Name Node

Using MESOS as cluster manager

• Coarse grained mode: number of cores allocated to each Spark

Executor are set at launching time, and cannot be changed
• Fine grained mode: number of cores associated to an Executor
can dynamically change, function of the number of concurrent
jobs and function of the load of each executor (specificity!)
 Better solution/mechanism to support many shell interpretors
 But latency can increase (Spark Streaming lib can be disturbed)

34
08/09/2021

Spark optimizations & deployment

1. Wide and Narrow transformations
2. Optimizations
3. Page Rank example
4. Deployment on clusters & clouds
• Task DAG execution
• Spark execution on clusters
• Ex of Spark execution on cloud

Using Amazon Elastic Compute

Cloud « EC2 »
spark-ec2 … -s <#nb of slave nodes>
-t <type of slave nodes>
launch MyCluster-1
Standalone
Spark Master
MyCluster‐1

35
08/09/2021

Using Amazon Elastic Compute

Cloud « EC2 »
spark-ec2 … -s <#nb of slave nodes>
-t <type of slave nodes>
launch MyCluster-1
Standalone Spark app. Driver
Standalone
Spark Master • DAG builder
MyCluster‐1 Spark Master • DAG scheduler‐optimizer
• Task scheduler

HDFS Spark Spark

Name Node executor executor
Spark Spark Spark
executor executor executor

Using Amazon Elastic Compute

Cloud « EC2 »
spark-ec2 … -s <#nb of slave nodes>
-t <type of slave nodes>
launch MyCluster-2

Standalone Spark app. Driver
Spark Master • DAG builder
MyCluster‐1

• DAG scheduler‐optimizer
• Task scheduler

HDFS Spark Spark

Name Node executor executor
Spark Spark Spark
executor executor executor
MyCluster‐2

Spark Master

HDFS
Name Node

36
08/09/2021

Using Amazon Elastic Compute

Cloud « EC2 »
spark-ec2 … -s <#nb of slave nodes>
-t <type
spark-ec2 destroy of slave nodes>
MyCluster-2
launch MyCluster-2

Standalone Spark app. Driver
MyCluster‐1 Spark Master • DAG builder
• DAG scheduler‐optimizer
• Task scheduler

HDFS Spark Spark

Name Node executor executor
Spark Spark Spark
executor executor executor
MyCluster‐2

Spark Master

HDFS
Name Node

Using Amazon Elastic Compute

Cloud « EC2 »
spark-ec2 … launch MyCluster-1
spark-ec2 get-master MyCluster-1  MasterNode
scp … myApp.jar root@MasterNode
spark-ec2 … login MyCluster-1
spark-submit --master spark://node:port … myApp

Standalone Spark app. Driver
Spark Master • DAG builder
MyCluster‐1

• DAG scheduler‐optimizer
• Task scheduler

HDFS Spark Spark

Name Node executor executor
Spark Spark Spark
executor executor executor

spark-ec2 destroy MyCluster-1

37
08/09/2021

Using Amazon Elastic Compute

HDFS
Name Node

spark-ec2 stop MyCluster-1  Stop billing

spark-ec2 … start MyCluster-1  Restart billing

spark-ec2 destroy MyCluster-1

Using Amazon Elastic Compute

Cloud « EC2 »
Start to learn to deploy HDFS and Spark architectures
Then, learn to deploy these architectecture in a CLOUD
… or use a ‘’Spark Cluster service’’: ready to use in a CLOUD!

Learn to minimize the cost (€) of a Spark cluster:

• Allocate the right number of nodes
• Stop when you do not use, and re‐start further

Choose to allocate reliable or preemptible machines:

• Reliable machines during all the session (standard)
• Preemptibles machines (5x less expensive!)
 require to support to loose some tasks, or to checkpoint…

38
08/09/2021

Spark optimizations & deployment

Full Download Articulating the Moral Community: Toward a Constructive Ethical Pragmatism Henry Richardson PDF DOCX
100% (3)
Full Download Articulating the Moral Community: Toward a Constructive Ethical Pragmatism Henry Richardson PDF DOCX
51 pages
Lab - Qlik Replicate Azure Databricks
No ratings yet
Lab - Qlik Replicate Azure Databricks
16 pages
Pythons Basics
No ratings yet
Pythons Basics
104 pages
Snowflake Certification
No ratings yet
Snowflake Certification
102 pages
Srilakshi M Resume
No ratings yet
Srilakshi M Resume
6 pages
Iti Pdfs
No ratings yet
Iti Pdfs
10 pages
De Mod 5 Deploy Workloads With Databricks Workflows
No ratings yet
De Mod 5 Deploy Workloads With Databricks Workflows
19 pages
Apache Hive Tutorial
No ratings yet
Apache Hive Tutorial
139 pages
EXAM Oblicon 1stquiz Wans
No ratings yet
EXAM Oblicon 1stquiz Wans
6 pages
Mastering Apache Cassandra - Second Edition
From Everand
Mastering Apache Cassandra - Second Edition
Nishant Neeraj
No ratings yet
3 Lecture 3-ETL
100% (1)
3 Lecture 3-ETL
42 pages
Databricks Question
No ratings yet
Databricks Question
7 pages
O Reilly Data Lake Bootcamp Day 11694182865124
No ratings yet
O Reilly Data Lake Bootcamp Day 11694182865124
46 pages
Spark Syllabus 1
No ratings yet
Spark Syllabus 1
3 pages
Business Intelligence DW
No ratings yet
Business Intelligence DW
17 pages
Databricksmcqsquestionsandanswers
No ratings yet
Databricksmcqsquestionsandanswers
5 pages
Pyspark RDD Cheat Sheet Python For Data Science
No ratings yet
Pyspark RDD Cheat Sheet Python For Data Science
1 page
Ambari Operations
No ratings yet
Ambari Operations
194 pages
Learning Apache Spark With Python
No ratings yet
Learning Apache Spark With Python
10 pages
Advanced Data Model
No ratings yet
Advanced Data Model
18 pages
Azure Data Engineering Course
No ratings yet
Azure Data Engineering Course
20 pages
Matillion Optimizing Snowflake
No ratings yet
Matillion Optimizing Snowflake
23 pages
Talend Installation Guide (Data Service Platform)
No ratings yet
Talend Installation Guide (Data Service Platform)
14 pages
Snowflake Demo
No ratings yet
Snowflake Demo
13 pages
2018 02 08 Whats New in Apache Spark 2 180213220045
No ratings yet
2018 02 08 Whats New in Apache Spark 2 180213220045
57 pages
Apache Druid: Sudhindra Tirupati Nagaraj
No ratings yet
Apache Druid: Sudhindra Tirupati Nagaraj
12 pages
Dice Resume CV SN
No ratings yet
Dice Resume CV SN
5 pages
Polars Vs Pandas - Benchmarking Performances and Beyond - LinkedIn
No ratings yet
Polars Vs Pandas - Benchmarking Performances and Beyond - LinkedIn
12 pages
Snowflake To Oracle
No ratings yet
Snowflake To Oracle
16 pages
(English (Auto-Generated) ) Building End-to-End Delta Pipelines On GCP (DownSub - Com)
No ratings yet
(English (Auto-Generated) ) Building End-to-End Delta Pipelines On GCP (DownSub - Com)
24 pages
PySpark Cheatsheet
No ratings yet
PySpark Cheatsheet
12 pages
Unstructured Dataload Into Hive Database Through PySpark
No ratings yet
Unstructured Dataload Into Hive Database Through PySpark
9 pages
Jarupula Praveen
No ratings yet
Jarupula Praveen
7 pages
Oracle 12c - CDB - PDB - Performing Basic Tasks PDF
No ratings yet
Oracle 12c - CDB - PDB - Performing Basic Tasks PDF
18 pages
Data Bricks
No ratings yet
Data Bricks
43 pages
WP - Databricks vs. ETL Data Lake - Updated
No ratings yet
WP - Databricks vs. ETL Data Lake - Updated
12 pages
Matthieu - Lamairesse - Reda - Khouani - Why The Best Serverless Data Warehouse Is A Lakehouse - (DAIWT - PARIS)
No ratings yet
Matthieu - Lamairesse - Reda - Khouani - Why The Best Serverless Data Warehouse Is A Lakehouse - (DAIWT - PARIS)
38 pages
Snowflake Architecture
No ratings yet
Snowflake Architecture
18 pages
Top Pyspark InterviewQuestions
No ratings yet
Top Pyspark InterviewQuestions
21 pages
Spark Summit East 2015 - Adv Dev Ops - Student Slides
No ratings yet
Spark Summit East 2015 - Adv Dev Ops - Student Slides
219 pages
Spark A To Z
No ratings yet
Spark A To Z
63 pages
Main - Page Integration Services (SSIS) : Transformation Description Examples of When Transformation Would Be Used
No ratings yet
Main - Page Integration Services (SSIS) : Transformation Description Examples of When Transformation Would Be Used
5 pages
Performance Tuning Spark UI
No ratings yet
Performance Tuning Spark UI
37 pages
Talend Data Integration Basics
No ratings yet
Talend Data Integration Basics
3 pages
Data Prep Ebook Snowflake 1
No ratings yet
Data Prep Ebook Snowflake 1
8 pages
SQL Datetime Conversion - String Date Convert Formats - SQLUSA PDF
No ratings yet
SQL Datetime Conversion - String Date Convert Formats - SQLUSA PDF
11 pages
Talend Open Studio For Data Integration: User Guide
No ratings yet
Talend Open Studio For Data Integration: User Guide
452 pages
10190-Move and Improve With Oracle Analytics Cloud-Presentation - 287
No ratings yet
10190-Move and Improve With Oracle Analytics Cloud-Presentation - 287
69 pages
Midhun BIGDATA Curicullum
No ratings yet
Midhun BIGDATA Curicullum
17 pages
Interview Questions
No ratings yet
Interview Questions
2 pages
ERModel PDF
100% (1)
ERModel PDF
82 pages
Azure Data Factory
No ratings yet
Azure Data Factory
6 pages
Data Engineering & GCP Basic Services 2. Data Storage in GCP 3. Database Offering by GCP 4. Data Processing in GCP 5. ML/AI Offering in GCP
No ratings yet
Data Engineering & GCP Basic Services 2. Data Storage in GCP 3. Database Offering by GCP 4. Data Processing in GCP 5. ML/AI Offering in GCP
3 pages
MongoBoulder - Schema Design
No ratings yet
MongoBoulder - Schema Design
59 pages
Python Syllbus by Lokesh
No ratings yet
Python Syllbus by Lokesh
5 pages
Set Your Data in Motion
No ratings yet
Set Your Data in Motion
8 pages
SCD Type-1,2 Implementation in Pyspark
No ratings yet
SCD Type-1,2 Implementation in Pyspark
6 pages
Low Level Design
No ratings yet
Low Level Design
23 pages
TalendOpenStudio BigData UG 5.2.1 en
No ratings yet
TalendOpenStudio BigData UG 5.2.1 en
266 pages
Spark Use Cases
No ratings yet
Spark Use Cases
2 pages
Mongodb Cheat Sheet
No ratings yet
Mongodb Cheat Sheet
10 pages
Mongodb Spark
No ratings yet
Mongodb Spark
13 pages
Simple Code To Control Pioneer P3-Dx With Matlab & Coppeliasim (Old V-Rep)
No ratings yet
Simple Code To Control Pioneer P3-Dx With Matlab & Coppeliasim (Old V-Rep)
3 pages
Fardin IBM
No ratings yet
Fardin IBM
42 pages
High 1 Workbook Answer
No ratings yet
High 1 Workbook Answer
10 pages
Guide: Hospital Safety Index
No ratings yet
Guide: Hospital Safety Index
176 pages
Fadly Novira: Curriculum Vitae
No ratings yet
Fadly Novira: Curriculum Vitae
4 pages
Worksheet Num Dif
No ratings yet
Worksheet Num Dif
3 pages
Y08 1028 PDF
No ratings yet
Y08 1028 PDF
8 pages
6 - Surface Modeling NX SW
No ratings yet
6 - Surface Modeling NX SW
62 pages
3 RJTA-pub2 PDF
No ratings yet
3 RJTA-pub2 PDF
11 pages
Pune Panchgani Mahabaleshwar
No ratings yet
Pune Panchgani Mahabaleshwar
28 pages
Ooty - Hogenakkal - Bangalore - Coorg Educational Tour: Bachelor of Technology
100% (1)
Ooty - Hogenakkal - Bangalore - Coorg Educational Tour: Bachelor of Technology
31 pages
(12CLE) Lance Dy Journal Entry #4
No ratings yet
(12CLE) Lance Dy Journal Entry #4
2 pages
Chapter Three The Ancient World
No ratings yet
Chapter Three The Ancient World
30 pages
Cisco Script PT 3.6.1 Packetracer Skills Challenge
No ratings yet
Cisco Script PT 3.6.1 Packetracer Skills Challenge
5 pages
Subtraction
No ratings yet
Subtraction
26 pages
Seth Technologies Manufactures Culture Mediums That Biotech and Pharmaceutical Research
No ratings yet
Seth Technologies Manufactures Culture Mediums That Biotech and Pharmaceutical Research
1 page
Glossary of Business Terms
No ratings yet
Glossary of Business Terms
9 pages
Graph of Equation Xy Constant
No ratings yet
Graph of Equation Xy Constant
2 pages
Evolution of Fiqh PDF
0% (1)
Evolution of Fiqh PDF
2 pages
Lesson Plan
No ratings yet
Lesson Plan
8 pages
wph13 01 Rms 20190815
No ratings yet
wph13 01 Rms 20190815
10 pages
#Form Master List Alat Ukur - All
No ratings yet
#Form Master List Alat Ukur - All
18 pages
Buku Teks Matematik Tahun 6 KSSR
No ratings yet
Buku Teks Matematik Tahun 6 KSSR
201 pages
The Museum Journal07
No ratings yet
The Museum Journal07
338 pages
All About Threads PDF
No ratings yet
All About Threads PDF
85 pages
Rosemary - Curriculum - Vitae (1) 2
No ratings yet
Rosemary - Curriculum - Vitae (1) 2
3 pages
A Certain Magical Index - Volum - Kazuma Kamachi
No ratings yet
A Certain Magical Index - Volum - Kazuma Kamachi
181 pages
P1C8 Integration (Exercises)
No ratings yet
P1C8 Integration (Exercises)
24 pages