简体繁体中英

Spark on Hadoop YARN - executor missing

原文 2016-10-10 22:37:48 7 1 hadoop/ apache-spark/ yarn/ apache-spark-1.5

I have a cluster of 3 macOS machines running Hadoop and Spark-1.5.2 (though with Spark-2.0.0 the same problem exists). With 'yarn' as the Spark master URL, I am running into a strange issue where tasks are only allocated to 2 of the 3 machines.

Based on the Hadoop dashboard (port 8088 on the master) it is clear that all 3 nodes are part of the cluster. However, any Spark job I run only uses 2 executors.

For example here is the "Executors" tab on a lengthy run of the JavaWordCount example: "batservers" is the master. There should be an additional slave, "batservers2", but it's just not there.

Why might this be?

Note that none of my YARN or Spark (or, for that matter, HDFS) configurations are unusual, except provisions for giving the YARN resource- and node-managers extra memory.

1 answers

Remarkably, all it took was a detailed look at the spark-submit help message to discover the answer:

YARN-only:

...

--num-executors NUM Number of executors to launch ( Default: 2 ).

If I specify --num-executors 3 in my spark-submit command, the 3rd node is used.

Yarn - executor for spark job

Spark installation on Hadoop Yarn

Configuring Executor and Driver memory in Spark-on-Yarn

Hadoop Yarn container logs missing

SPARK_EXECUTOR_INSTANCES not working in SPARK SHELL, YARN CLIENT MODE

Issue with Apache Spark working on Hadoop YARN

Spark with Hadoop Yarn : Use the entire cluster nodes

ContainerLaunchContext.setResource() missing of hadoop yarn

How to estimate amount of spark executor on a Hortonworks Hadoop cluster?

How to get or create a Hadoop client from a Spark Executor

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Yarn - executor for spark job Spark installation on Hadoop Yarn Configuring Executor and Driver memory in Spark-on-Yarn Hadoop Yarn container logs missing SPARK_EXECUTOR_INSTANCES not working in SPARK SHELL, YARN CLIENT MODE Issue with Apache Spark working on Hadoop YARN Spark with Hadoop Yarn : Use the entire cluster nodes ContainerLaunchContext.setResource() missing of hadoop yarn How to estimate amount of spark executor on a Hortonworks Hadoop cluster? How to get or create a Hadoop client from a Spark Executor

Related Tags

Spark on Hadoop YARN - executor missing

Question

1 answers

solution1 0 ACCPTED 2016-10-10 23:18:56

solution1
0 ACCPTED 2016-10-10 23:18:56