[英]external table with partitions in hive
I have a bunch of tsv files in HDFS in a directory structure that follows the partition convention where an event_dt
is the partition. 我在HDFS中的目录结构中有一堆tsv文件,该目录结构遵循分区约定,其中
event_dt
是分区。
some_path/event_dt=2017-04-30
some_path/event_dt=2017-05-01
and so on. 等等。
The issue is that event_dt is also one of the columns. 问题是event_dt也是列之一。 The second one in particular.
特别是第二个。 But I cannot specify so since
event_dt
cannot appear in the table schema and in the PARTITIONED BY
statement. 但是我无法指定,因为
event_dt
不能出现在表模式和PARTITIONED BY
语句中。 That triggers: 触发:
Column repeated in partitioning columns
Is there a way around this other than using different names. 除了使用不同的名称之外,还有其他方法吗? It is, after all, the same information.
毕竟,它是相同的信息。
3 options if you dont want to rename the column. 3个选项,如果您不想重命名列。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.