简体   繁体   English

使用模式文件创建Parquet支持的Hive表

[英]create a Parquet backed Hive table by using a schema file

Cloudera documentation , shows a simple way to " create a Avro backed Hive table by using an Avro schema file ." Cloudera 文档显示了一种“ 使用Avro架构文件创建Avro支持的Hive表 ”的简单方法。 This works great. 这非常有效。 I would like to do the same thing for a Parquet backed Hive table, but the relevant documentation in this case lists out every column type rather than reading from a schema. 我想为Parquet支持的Hive表做同样的事情,但本案例中的相关文档列出了每个列类型而不是从模式中读取。 Is it possible to read the Parquet columns from a schema, in the same way as Avro data? 是否可以从模式中读取Parquet列,与Avro数据相同?

Currently, the answer appears to be no . 目前,答案似乎是否定的 There is an open issue with Hive. Hive有一个未解决的问题。 https://issues.apache.org/jira/browse/PARQUET-76 https://issues.apache.org/jira/browse/PARQUET-76

The issue has been active recently, so hopefully in the near future Hive will offer the same functionality for Parquet as it does for Avro. 这个问题最近一直很活跃,所以希望在不久的将来,Hive将为Parquet提供与Avro相同的功能。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM