簡體   English   中英

無法使用配置單元(CDH 5.9.0)查詢結構字段

[英]Can not query struct field with hive (CDH 5.9.0)

我只是切換到CDH 5.9.0(在新群集上進行全新安裝,而不是升級)。 我有一個像這樣的表(稍微復雜一點,但是我也用這個例子重現):

CREATE TABLE `products`(`header` struct<PCODE:string, PNAME:string>)
PARTITIONED BY (`IMPORT_DATE' string)
ROW FORMAT SERDE 
  'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe' 
STORED AS INPUTFORMAT 
  'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat'
OUTPUTFORMAT 
  'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat'
LOCATION
  'hdfs://myhost.com:8020/user/hive/warehouse/dbp/products'
TBLPROPERTIES ('transient_lastDdlTime'='1482160314')

如果我做:

SELECT header FROM products;

==>查詢成功,並返回所有產品標頭(JSON格式)

但是,如果我這樣做:

SELECT header.PCODE FROM products;

==>使用以下堆棧跟蹤失敗:

Error: java.lang.RuntimeException: Error in configuring object
                at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
                at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
                at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
                at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:449)
                at org.apache.hadoop.mapred.MapTask.run(MapTask.java:343)
                at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:164)
                at java.security.AccessController.doPrivileged(Native Method)
                at javax.security.auth.Subject.doAs(Subject.java:422)
                at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1698)
                at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158)
Caused by: java.lang.reflect.InvocationTargetException
                at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
                at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
                at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
                at java.lang.reflect.Method.invoke(Method.java:498)
                at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
                ... 9 more
Caused by: java.lang.RuntimeException: Error in configuring object
                at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
                at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
                at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
                at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:38)
                ... 14 more
Caused by: java.lang.reflect.InvocationTargetException
                at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
                at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
                at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
                at java.lang.reflect.Method.invoke(Method.java:498)
                at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
                ... 17 more
Caused by: java.lang.RuntimeException: Map operator initialization failed
                at org.apache.hadoop.hive.ql.exec.mr.ExecMapper.configure(ExecMapper.java:147)
                ... 22 more
Caused by: java.lang.NullPointerException
                at org.apache.hadoop.hive.ql.exec.ExprNodeFieldEvaluator.initialize(ExprNodeFieldEvaluator.java:61)
                at org.apache.hadoop.hive.ql.exec.ExprNodeFieldEvaluator.initialize(ExprNodeFieldEvaluator.java:53)
                at org.apache.hadoop.hive.ql.exec.Operator.initEvaluators(Operator.java:954)
                at org.apache.hadoop.hive.ql.exec.Operator.initEvaluatorsAndReturnStruct(Operator.java:980)
                at org.apache.hadoop.hive.ql.exec.SelectOperator.initializeOp(SelectOperator.java:63)
                at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:385)
                at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:469)
                at org.apache.hadoop.hive.ql.exec.Operator.initializeChildren(Operator.java:425)
                at org.apache.hadoop.hive.ql.exec.TableScanOperator.initializeOp(TableScanOperator.java:193)
                at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:385)
                at org.apache.hadoop.hive.ql.exec.MapOperator.initializeOp(MapOperator.java:431)
                at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:385)
                at org.apache.hadoop.hive.ql.exec.mr.ExecMapper.configure(ExecMapper.java:126)
                ... 22 more

在我的舊群集(CDH 5.8.2)上,它可以正常工作。 任何想法?

[編輯:我已經將所有CDH 5.9.0 jar(/ opt / cloudera / parcels / CDH / jars)降級為CDH 5.8.2,並且查詢成功。 CDH 5.9.0中可能存在回歸...]

[編輯2:如果表存儲為TextFile('org.apache.hadoop.mapred.TextInputFormat'),則查詢成功運行。 我們可以認為問題與實木復合地板有關。]

[還發布在Cloudera論壇上: https : //community.cloudera.com/t5/Batch-SQL-Apache-Hive/Can-not-query-struct-field-with-hive-CDH-5-9-0/mp / 48672#U48672 ]

我將通過減小查詢元素的大小寫來解決此問題。 P.ex:

SELECT header.pcode FROM產品;

因此,我嘗試了很多事情,最終得到以下結果:

-- Struct fieldnames in lowercase
CREATE TABLE `products`(`header` struct<pcode:string, pname:string>) STORED AS PARQUET;

選擇結果:

  • SELECT header.pcode FROM products ==> 確定
  • SELECT HEADER.pcode FROM products ==> 確定
  • SELECT header.PCODE FROM products ==> KO
  • SELECT HEADER.PCODE FROM products ==> KO

-- Struct fieldnames in UPPERCASE
CREATE TABLE `products`(`header` struct<PCODE:string, PNAME:string>) STORED AS PARQUET;

選擇結果:

  • SELECT header.pcode FROM products ==> KO
  • SELECT HEADER.pcode FROM products ==> KO
  • SELECT header.PCODE FROM products ==> KO
  • SELECT HEADER.PCODE FROM products ==> KO

==>避免在結構字段名中使用大寫形式,而將表存儲為CDH 5.9.0中的PARQUET(在CDH 5.8.2中有效)...

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM