繁体 English 中英

从HDInsight群集头节点运行Spark应用程序

[英]Running spark application from HDInsight cluster headnode

原文 2017-03-27 13:25:13 3 1 azure/ apache-spark/ hdinsight/ azure-data-factory/ apache-spark-2.0

我正在尝试使用命令从Azure HDInsight群集的头节点运行spark scala应用程序

火花提交--class com.test.spark.Wordcount SparkJob1.jar wasbs：// 容器名称 @ <storageaccountname> /sample.sas7bdat wasbs：// 容器名称 @ <storageaccountname> /sample.csv

我正在接受它的例外。

由以下原因导致：java.lang.ClassCastException：无法将scala.collection.immutable.List $ SerializationProxy的实例分配给scala类型的字段org.apache.spark.rdd.RDD.org $ apache $ spark $ rdd $ RDD $$ dependencies_。 org.apache.spark.rdd.MapPartitionsRDD实例中的collection.Seq

如果我从Azure数据工厂调用，则同一个jar文件正在工作。 我是否缺少使用spark-submit命令进行的某些配置？

1 个解决方案

通常，这是由您关于类型转换的代码逻辑引起的。 有一个类似的SO线程如何解决java.lang.ClassCastException：无法将scala.collection.immutable.List实例分配给字段类型scala.collection.Seq？ 已经回答了这些问题，我想您可以参考它并检查您的代码以解决问题。

在Azure HdInsight的Linux群集上的Spark中运行Zeppelin段落时出错

[英]Error while running Zeppelin paragraphs in Spark on Linux cluster in Azure HdInsight

无法从HDInsight群集上的Spark UI访问日志

[英]Can't Access Logs from Spark UI on HDInsight cluster

使用资源管理器模板创建HDInsight群集时，列出头节点和工作节点的允许值

[英]List allowed values for headnode and workernode while creating HDInsight cluster using resource manager template

HDinsight上运行的Spark中的故障恢复

[英]Failure recovery in spark running on HDinsight

在HDInsight群集上远程执行Spark作业

[英]Remotely execute a Spark job on an HDInsight cluster

如何从基于 Intellij 构建的本地 Spark 服务器访问位于 HDInsight 中的 Hive 集群中的表

[英]How to access table from Hive cluster located in HDInsight from Local Spark Server built on Intellij

如何从运行在 hdinsight 集群上的代码中获取集群详细信息，例如 clusterID

[英]How to get cluster details like clusterID from code running on hdinsight cluster

无法在HDI 3.6 Spark HDInsight群集中从Ambari访问oozie的Web UI

[英]unable to access web UI of oozie from Ambari in HDI 3.6 spark HDInsight cluster

从在HDInsight群集中运行的Map / Reduce作业访问Azure Table存储

[英]Accessing Azure Table storage from Map/Reduce job running in a HDInsight cluster

如何从在HDInsight上运行的Apache Spark读取Azure表存储数据

[英]How to read Azure Table Storage data from Apache Spark running on HDInsight

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 在Azure HdInsight的Linux群集上的Spark中运行Zeppelin段落时出错无法从HDInsight群集上的Spark UI访问日志使用资源管理器模板创建HDInsight群集时，列出头节点和工作节点的允许值 HDinsight上运行的Spark中的故障恢复在HDInsight群集上远程执行Spark作业如何从基于 Intellij 构建的本地 Spark 服务器访问位于 HDInsight 中的 Hive 集群中的表如何从运行在 hdinsight 集群上的代码中获取集群详细信息，例如 clusterID 无法在HDI 3.6 Spark HDInsight群集中从Ambari访问oozie的Web UI 从在HDInsight群集中运行的Map / Reduce作业访问Azure Table存储如何从在HDInsight上运行的Apache Spark读取Azure表存储数据

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM