Spark tuning

transcript

TuningQ4’s Research Report

linhtm@runsystem.net

Agenda

1. Tuning Spark parametera. Control Spark’s resource usageb. Advanced Parameterc. Dynamic Allocation

2. Tips for tuning your Spark program3. Example use case of tuning Spark

algorithm

Tuning Spark Parameter

The easy way

If you Spark application is slow, just let it have more system resources.

Is there anything simpler?

Spark Architecture Simplified

Control Spark’s resource usage

• spark-submit command’s parameter (some only available when using in YARN)

Parameter Description Default value

num-executor Number of executors to launch 2

executor-cores Number of cores per executor 1

executor-memory Memory per executor 1G

driver-cores Number of cores used by the driver, only in YARN cluster mode

driver-memory Memory for driver 1G

Calculate the right values

• For example: 4 servers for Spark, each server has 64gb ram, 16 cores. How should we set those spark-submit’s parameters?– --num-executors 4 --executor-memory 63g --

executor-cores 15– --num-executors 7 --executor-memory 29GB --

executor-cores 7– --num-executors 11 --executor-memory 19GB --

executor-cores 5

Spark Executor’s Memory Model

• Memory request from YARN for each container = spark.executor.memory + spark.yarn.executor.memoryOverhead

• spark.yarn.executor.memoryOverhead = max(spark.executor.memory * 0.1, 384mb)

Move advanced parameters

spark.shuffle.memoryFraction Fraction of Java heap to use for aggregation and cogroups during shuffles

spark.reducer.maxSizeInFlight Maximum size of map outputs to fetch simultaneously from each reduce task

spark.shuffle.consolidateFiles If set to "true", consolidates intermediate files created during a shuffle

spark.shuffle.file.buffer Size of the in-memory buffer for each shuffle file output stream

spark.storage.memoryFraction Fraction of Java heap to use for Spark's memory cache 0.6

spark.akka.frameSize Number of actor threads to use for communication 4

spark.akka.threads Maximum message size to allow in "control plane" communication (for serialized tasks and task results), in MB

Advanced Spark memory

Demo Spark UI

Using Dynamic Allocation

• Dynamically scale the set of cluster resources allocated to your application up and down based on the workload

• Only available when using YARN as cluster management tool

• Must use an external shuffle service, so must config a shuffle service with YARN

Dynamic Allocation parameters (1)

spark.shuffle.service.enabled Enables the external shuffle service. This service preserves the shuffle files written by executors so the executors can be safely removed

spark.dynamicAllocation.enabled Whether to use dynamic resource allocation

spark.dynamicAllocation.executorIdleTimeout

If an executor has been idle for more than this duration, the executor will be removed

spark.dynamicAllocation.cachedExecutorIdleTimeout

If an executor which has cached data blocks has been idle for more than this duration, the executor will be removed

Infinity

spark.dynamicAllocation.initialExecutors Initial number of executors to run minExecutor

Dynamic Allocation parameters (2)

spark.dynamicAllocation.maxExecutors Upper bound for the number of executors

Infinity

spark.dynamicAllocation.minExecutors Lower bound for the number of executors

spark.dynamicAllocation.schedulerBacklogTimeout

If there have been pending tasks backlogged for more than this duration, new executors will be requested

spark.dynamicAllocation.sustainedSchedulerBacklogTimeout

Same as spark.dynamicAllocation.schedulerBacklogTimeout, but used only for subsequent executor requests

schedulerBacklogTimeout

Dynamic Allocation in Action

Dynamic Allocation - The verdict

• Dynamic Allocation help using your cluster resource more efficiently

• But only effective when Spark Application is a long running one with different long stages with different number of tasks (Spark Streaming?)

• In addition, when an executor is removed, all cached data will no longer be accessible

Tips for Tuning Your Spark Program

Tuning Memory Usage

• Prefer arrays of objects, and primitive types, instead of the standard Java or Scala collection classes (e.g. HashMap).

• Avoid nested structures with a lot of small objects and pointers when possible.

• Using numeric IDs or enumeration objects instead of strings for keys.

• If you have less than 32 GB of RAM, set the JVM flag -XX:+UseCompressedOops to make pointers be four bytes instead of eight.

Other Tuning Tips (1)

● Using KryoSerializer instead of default JavaSerilizer● Know when to persist RDD and determine the right

level of storage level○ MEMORY_ONLY○ MEMORY_AND_DISK○ MEMORY_ONLY_SER○ …

● Choose the right level of parallelism○ spark.default.parallelism○ repartition○ 2nd arguments for methods in spark.

PairRDDFunctions19

Other tuning tips (2)

• Broadcast large variables• Do not collect on large RDDs (should filter first)• Careful when using operation that require data

shuffle (join, reduceByKey, groupByKey…)• Avoid groupByKey, use reduceByKey or

aggregateByKey or combineByKey (low level) if possible.

groupByKey vs reduceByKey (1)

groupByKey vs reduceByKey (2)

Example use case of tuning Spark algorithm

Tuning CF algorithm in RW project

• 1st algorithm, no parameter tuning: 27mins• 1st algorithm, parameters tuned: 18mins• 2nd algorithm (from Spark code), parameters tuned:

~ 7mins 30s• 3nd algorithm (improved Spark code), parameters

tuned: ~6mins 30s

Thank You!

Spark tuning

Technology