简体   繁体   English

找不到 oozie spark 动作表

[英]oozie spark action table not found

I am trying to set up a spark action workflow within apache oozie though I'm getting the following error when select * from db.table is called within my spark code in a hive context:我正在尝试在 apache oozie 中设置一个 spark 操作工作流,尽管在配置单元上下文中的 spark 代码中调用select * from db.table时出现以下错误:

org.apache.spark.sql.AnalysisException: Table not found: `db`.`table`; line 1 pos 34

This spark job works with spark-submit so I can't seem to nail down the issue.这个 spark 作业与 spark-submit 一起工作,所以我似乎无法确定这个问题。 I've added hive-site.xml to various locations recommended in previous questions such as the workspace lib directory and the workspace directory and added it to the job.xml setting though I still get the same issue.我已将 hive-site.xml 添加到先前问题中推荐的各种位置,例如工作区 lib 目录和工作区目录,并将其添加到 job.xml 设置,但我仍然遇到同样的问题。

I'm running in deploy mode cluster and master yarn.我在部署模式集群和主纱中运行。

I've tried many combinations and not sure what else to do.我尝试了很多组合,但不确定还能做什么。

Where am I going wrong?我哪里错了?

It is necessary to add the Hive configuration.需要添加 Hive 配置。 For example, adding in the action of the workflow de file where it is.例如,在工作流文件所在的位置添加操作。

<spark xmlns="uri:oozie:spark-action:1.0">
   <!-- ... ->
   <file>${hiveConfig}</file>
</spark>

In job.properties must be the reference:在 job.properties 中必须引用:

hiveConfig=/user/oozie/extraconfig/hive-site.xml

This file must be in each node of cluster该文件必须在集群的每个节点中

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM