简体   繁体   English

集群上的 Spark 进程是一个 java 进程

[英]Spark process on cluster is a java process

I don't know much about java.我对java了解不多。 Can anybody explain this line for me.任何人都可以为我解释这条线。

"Any, any apache Spark process that would ever work on your cluster or local machine is a JVM process. " “任何可以在你的集群或本地机器上运行的 apache Spark 进程都是一个 JVM 进程。”

Spark is based on concept of driver and executor, where driver is sort of master and executor are slaves. Spark基于驱动程序和执行程序的概念,其中驱动程序是master,执行程序是slave。 There are two type of operations in spark: Transformation and action. spark中有两种操作:转换和动作。 Although only Driver can initiate one operations Executors are one who actually perform the operation.虽然只有 Driver 可以发起一项操作,但 Executors 是实际执行操作的人。 So you can say whatever happens in spark execution, driver and executor are only one responsible.所以你可以说无论在spark执行中发生什么,驱动程序和执行程序只有一个负责。 And the driver and executor are nothing else but JVM process or JVM container (yarn).而驱动程序和执行程序只不过是 JVM 进程或 JVM 容器(纱线)。 So every operations is executed by JVM process only ( Driver and Executor).所以每个操作都只由 JVM 进程(驱动程序和执行程序)执行。 Hope this explains your query.希望这可以解释您的查询。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM