简体繁体中英

run spark machine learning example on yarn failed

原文 2017-02-27 13:46:14 2 1 hadoop/ apache-spark/ yarn

After start the dfs, yarn and spark, I run these code under the root directory of spark on master host:

MASTER=yarn ./bin/run-example ml.LogisticRegressionExample \\ data/mllib/sample_libsvm_data.txt

Actually I get these code from Spark's README, and here is the source code about LogisticRegressionExample on GitHub: https://github.com/apache/spark/blob/master/examples/src/main/scala/org/apache/spark/examples/ml/LogisticRegressionExample.scala

Then error occurs:

Exception in thread "main" org.apache.spark.sql.AnalysisException: Path does not exist: hdfs://master:9000/user/root/data/mllib/sample_libsvm_data.txt;

Firstly, I don't know why it's hdfs://master:9000/user/root , I do set namenode's IP address to hdfs://master:9000 , but why spark chose /user/root ?

Then, I make a directory /user/root/data/mllib/sample_libsvm_data.txt on every host of the cluster, so I hope spark can find this file. But the same error occurs again. Please tell me how to fix it.

1 answers

Spark is looking for the file on HDFS, not the regular Linux file system. The path you've given to your data (data/mllib/sample_libsvm_data.txt) is a relative path. In HDFS, relative paths are assumed to begin within your home directory.

The LogRegExample.scala on github assumes a local execution, not a yarn execution. If you want to perform a yarn execution then you need to upload files to HDFS.

Run Spark official python machine learning example on Yarn failed

Not able to run spark job on yarn cluster: Connection Failed Exception

Steps to Run Spark with Yarn

Spark 1.3.0: Running Pi example on YARN fails

Spark on Yarn Failed to send RPC and Slave lost

Run HiveFromSpark example with MASTER=yarn-cluster

Not able to run Spark job on YARN cluster

Spark not able to run in yarn cluster mode

Is it possible to run Hive on Spark with YARN capacity scheduler?

Spark yarn return exit code not updating as failed in webUI - spark submit

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Run Spark official python machine learning example on Yarn failed Not able to run spark job on yarn cluster: Connection Failed Exception Steps to Run Spark with Yarn Spark 1.3.0: Running Pi example on YARN fails Spark on Yarn Failed to send RPC and Slave lost Run HiveFromSpark example with MASTER=yarn-cluster Not able to run Spark job on YARN cluster Spark not able to run in yarn cluster mode Is it possible to run Hive on Spark with YARN capacity scheduler? Spark yarn return exit code not updating as failed in webUI - spark submit

Related Tags

run spark machine learning example on yarn failed

Question

1 answers

solution1 0 ACCPTED 2017-02-28 03:37:31

solution1
0 ACCPTED 2017-02-28 03:37:31