[英]oozie spark action table not found
I am trying to set up a spark action workflow within apache oozie though I'm getting the following error when select * from db.table
is called within my spark code in a hive context:我正在尝试在 apache oozie 中设置一个 spark 操作工作流,尽管在配置单元上下文中的 spark 代码中调用
select * from db.table
时出现以下错误:
org.apache.spark.sql.AnalysisException: Table not found: `db`.`table`; line 1 pos 34
This spark job works with spark-submit so I can't seem to nail down the issue.这个 spark 作业与 spark-submit 一起工作,所以我似乎无法确定这个问题。 I've added hive-site.xml to various locations recommended in previous questions such as the workspace lib directory and the workspace directory and added it to the job.xml setting though I still get the same issue.
我已将 hive-site.xml 添加到先前问题中推荐的各种位置,例如工作区 lib 目录和工作区目录,并将其添加到 job.xml 设置,但我仍然遇到同样的问题。
I'm running in deploy mode cluster and master yarn.我在部署模式集群和主纱中运行。
I've tried many combinations and not sure what else to do.我尝试了很多组合,但不确定还能做什么。
Where am I going wrong?我哪里错了?
It is necessary to add the Hive configuration.需要添加 Hive 配置。 For example, adding in the action of the workflow de file where it is.
例如,在工作流文件所在的位置添加操作。
<spark xmlns="uri:oozie:spark-action:1.0">
<!-- ... ->
<file>${hiveConfig}</file>
</spark>
In job.properties must be the reference:在 job.properties 中必须引用:
hiveConfig=/user/oozie/extraconfig/hive-site.xml
This file must be in each node of cluster该文件必须在集群的每个节点中
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.