简体   繁体   English

在 pysparkling 独立集群中设置 spark.ext.h2o.backend.cluster.mode=external 时出错

[英]Getting error while setting spark.ext.h2o.backend.cluster.mode=external in pysparkling standalone cluster

Code:代码:

import pandas as pd
from pyspark.sql import SparkSession
from pysparkling import *
import h2o
from pysparkling.ml import H2OAutoML
spark = SparkSession.builder.appName('SparkApplication').getOrCreate()
hc = H2OContext.getOrCreate()

Spark-submit Command:火花提交命令:

spark-submit --master spark://local:7077 --py-files sparkling-water-3.36.1.3-1-3.2/py/h2o_pysparkling_3.2-3.36.1.3-1-3.2.zip --conf "spark.ext.h2o.backend.cluster.mode=external" --conf spark.ext.h2o.external.start.mode="auto" --conf spark.ext.h2o.external.h2o.driver="/home/whiz/spark/h2odriver-3.36.1.3.jar" --conf spark.ext.h2o.external.cluster.size=2 spark_h20/h2o_script.py spark-submit --master spark://local:7077 --py-files sparkling-water-3.36.1.3-1-3.2/py/h2o_pysparkling_3.2-3.36.1.3-1-3.2.zip --conf "spark .ext.h2o.backend.cluster.mode=external" --conf spark.ext.h2o.external.start.mode="auto" --conf spark.ext.h2o.external.h2o.driver="/home/ whiz/spark/h2odriver-3.36.1.3.jar" --conf spark.ext.h2o.external.cluster.size=2 spark_h20/h2o_script.py

Error Logs: py4j.protocol.Py4JJavaError: An error occurred while calling o58.getOrCreate.错误日志:py4j.protocol.Py4JJavaError:调用 o58.getOrCreate 时发生错误。 : java.io.IOException: Cannot run program "hadoop": error=2, No such file or directory** :java.io.IOException:无法运行程序“hadoop”:错误= 2,没有这样的文件或目录**

the automatic start of SW external backend is only support in Hadoop or K8s environments. SW 外部后端的自动启动仅在 Hadoop 或 K8s 环境中支持。 In a standalone deployment, you need to deploy the external backend manually according to the tutorial in SW documentation .在独立部署中,您需要根据SW 文档中的教程手动部署外部后端。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM