简体   繁体   English

非分区表对蜂巢的优势?

[英]Advantages of non-partitioned table on hive?

Are there any advantages of non-partitioned table on Hive -- their special use cases comparing to partitioned table? Hive上的非分区表有什么优势-与分区表相比,它们的特殊用例?

It will be great if anyone could help. 如果有人可以帮助,那就太好了。 :) :)

Let's put it that way: in the database world, partitioning can be used to solve different kinds of problems. 这么说吧:在数据库世界中,分区可用于解决各种问题。 As long as you have no explicit problem, don't bother with partitions (ie "if it ain't broken, don't fix it") . 只要您没有明显的问题,就不要理会分区(即“如果它没有损坏,请不要修复它”) Whenever you hit a problem, ask a DB architect to find a solution - may involve partitioning, maybe not. 每当您遇到问题时,都要求数据库架构师找到解决方案-可能涉及分区,也许不涉及。

But Hive is not a typical database. 但是Hive不是典型的数据库。 Partitions are everywhere, just because it's a crude workaround for the lack of indexes... 分区无处不在,只是因为缺少索引这是一种粗略的解决方法...

(Well, actually the ORC format has its own workaround [stores min/max values per column per stripe, which allows skipping useless stripes] so partitioning is less critical with that format) (好吧,实际上,ORC格式有其自己的解决方法[在每个条带的每一列中存储最小值/最大值,这允许跳过无用的条带],因此分区对该格式的要求不高)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何在Hive上对未分区的表进行分区? - How to partition a non-partitioned table on Hive? 查询整个 Hive 外部表上的非分区列 - Query non-partitioned column on Entire Hive external Table SPARK 1.6插入现有的Hive表(未分区) - SPARK 1.6 Insert into existing Hive table (non-partitioned) PySpark 数据从非分区 hive 表加载到分区 hive 表的性能调整 - Performance tuning for PySpark data load from a non-partitioned hive table to a partitioned hive table 外部非分区 Hive 表显示 0 条记录,即使 parquet 文件有数据,当使用 spark.read.parquet 读取时,它显示正确的数据 - External non-partitioned Hive table shows 0 records even if parquet file has data, when read using spark.read.parquet it shows correct data 我可以将已加载数据的非分区表更改为具有动态分区吗? - Can i alter non-Partitioned table with already loaded data to have dynamic partitions? 从非分区表创建分区的配置单元表 - Creating a partitioned hive table from a non partitioned table 插入Hive表-非分区表到分区表-无法插入目标表,因为列号/类型 - Inserting into Hive table - Non Partitioned table to Partitioned table - Cannot insert into target table because column number/types Hive在分区表中加载 - Hive loading in partitioned table 创建 Hive 分区表 - Create Hive Partitioned Table
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM