简体繁体中英

Yarn as resource manager in SPARK for linux cluster - inside Kubernetes and outside Kubernetes

原文 2021-02-14 08:01:13 4 1 apache-spark/ hadoop/ kubernetes/ google-kubernetes-engine/ yarn

If I am using Kubernetes cluster to run spark, then I am using Kubernetes resource manager in Spark.

If I am using Hadoop cluster to run spark, then I am using Yarn resource manager in Spark.

But my question is, if I am spawning multiple linux nodes in kebernetes, and use one of the node as spark maste and three other as worker, what resource manager should I use? can I use yarn over here?

Second question, in case of any 4 node linux spark cluster (not in kubernetes and not hadoop, simple connected linux machines), even if I do not have hdfs, can I use yarn here as resource manager? if not, then what resource manager should be used for saprk?

Thanks.

1 answers

if I am spawning multiple linux nodes in kebernetes,

Then you'd obviously use kubernetes, since it's available

in case of any 4 node linux spark cluster (not in kubernetes and not hadoop, simple connected linux machines), even if I do not have hdfs, can I use yarn here

You can, or you can use Spark Standalone scheduler, instead. However Spark requires a shared filesystem for reading and writing data, so, while you could attempt to use NFS, or S3/GCS for this, HDFS is faster

Spark with kubernetes instead of yarn

Submitting to a Spark Master inside a kubernetes cluster

Running YARN cluster in Kubernetes/Mesos

Can't create spark session using yarn inside kubernetes pod

spark-submit from a kubernetes pod to a MAPR cluster running spark in yarn

spark-submit on kubernetes cluster

Spark on YARN resource manager: Relation between YARN Containers and Spark Executors

Spark 2 on YARN is utilizing more cluster resource automatically

Spark executor cores not shown in yarn resource manager

Spark Master vs Yarn Resource manager

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Spark with kubernetes instead of yarn Submitting to a Spark Master inside a kubernetes cluster Running YARN cluster in Kubernetes/Mesos Can't create spark session using yarn inside kubernetes pod spark-submit from a kubernetes pod to a MAPR cluster running spark in yarn spark-submit on kubernetes cluster Spark on YARN resource manager: Relation between YARN Containers and Spark Executors Spark 2 on YARN is utilizing more cluster resource automatically Spark executor cores not shown in yarn resource manager Spark Master vs Yarn Resource manager

Related Tags

Yarn as resource manager in SPARK for linux cluster - inside Kubernetes and outside Kubernetes

Question

1 answers

solution1 1 ACCPTED 2021-02-14 16:04:41

solution1
1 ACCPTED 2021-02-14 16:04:41