独立集群模式：spark如何分配spark.executor.cores？

Question

I'm searching for how and where spark allocates cores per executor in the source code. 我正在寻找spark在源代码中如何以及在哪里为每个执行者分配内核。 Is it possible to control programmaticaly allocated cores in standalone cluster mode? 是否可以在独立集群模式下控制以编程方式分配的内核？

Regards, Matteo 问候，Matteo

Answer 1

Spark allows for configuration options to be passed through the .set method on the SparkConf class. Spark允许通过SparkConf类的.set方法传递配置选项。

Here's some scala code that sets up a new spark configuration: 这是一些scala代码，用于设置新的spark配置：

new SparkConf()
  .setAppName("App Name")
  .setMaster('local[2]')
  .set("spark.executor.cores", "2")

Documentation about the different configuration options: 有关不同配置选项的文档：

http://spark.apache.org/docs/1.6.1/configuration.html#execution-behavior http://spark.apache.org/docs/1.6.1/configuration.html#execution-行为

I haven't looked through the source code exhaustively, but I think this is the spot in the source code where the executor cores are defined prior to allocation: 我没有详尽地查看源代码，但是我认为这是源代码中在分配之前定义执行程序核心的地方：

https://github.com/apache/spark/blob/d6dc12ef0146ae409834c78737c116050961f350/core/src/main/scala/org/apache/spark/scheduler/cluster/ExecutorData.scala https://github.com/apache/spark/blob/d6dc12ef0146ae409834c78737c116050961f350/core/src/main/scala/org/apache/spark/scheduler/cluster/ExecutorData.scala

Answer 2

In stand alone mode, you have following options: 在独立模式下，您有以下选择：

a. 一种。 While starting the cluster, you can mention how many cpu cores to be allotted for spark applications. 在启动集群时，您可以提及要为Spark应用程序分配多少个cpu内核。 This can be set both as env variable SPARK_WORKER_CORES or passed as argument to shell script (-c or --cores) 既可以将其设置为环境变量SPARK_WORKER_CORES，也可以将其作为参数传递给Shell脚本（-c或--cores）

b. b。 Care should be taken (if other applications also share resources like cores) not to allow spark to take all the cores. 应当小心（如果其他应用程序也共享内核之类的资源），不要让spark占用所有内核。 This can be set using spark.cores.max parameter. 可以使用spark.cores.max参数进行设置。

c. C。 You can also pass --total-executor-cores <numCores> to the spark shell 您还可以将--total-executor-cores <numCores>传递给spark shell

For more info, you can look here 有关更多信息，您可以在这里查看

独立集群模式：spark如何分配spark.executor.cores？

问题描述

2 个解决方案

解决方案1
0 2016-06-08 17:49:21

解决方案2
0 2016-06-08 18:40:22

独立集群模式：spark如何分配spark.executor.cores？

问题描述

2 个解决方案

解决方案1 0 2016-06-08 17:49:21

解决方案2 0 2016-06-08 18:40:22

解决方案1
0 2016-06-08 17:49:21

解决方案2
0 2016-06-08 18:40:22