简体   繁体   English

Hive Metastore 最大数据库和表数

[英]Maximum number of databases and tables in Hive Metastore

Inspired by a formerly asked question: Maximum number of partitions in hive受以前提出的问题启发: hive 中的最大分区数

I would like to know if there is also maximum number for items below:我想知道以下项目是否也有最大数量:

  1. Maximum number of databases in a catalog (I assume Hive Metastore only has one catalog, which is "hive")一个目录中的最大数据库数(我假设 Hive Metastore 只有一个目录,即“hive”)
  2. Maximum number of tables per database (as in, can I create 10 million tables in a database or due to limitation must I split them into 10 databases each with 1 million tables)每个数据库的最大表数(例如,我可以在数据库中创建 1000 万个表还是由于限制必须将它们分成 10 个数据库,每个数据库有 100 万个表)

I also would like to know whether the limitations are hard limit (unconfigurable), or configurable by Hive, or dependent on RDBMS it is using.我还想知道这些限制是硬限制(不可配置),还是由 Hive 配置,或者取决于它使用的 RDBMS。

UPDATE: Crossposted to Cloudera Community after 7 days of inactivity.更新:在闲置 7 天后交叉发布到Cloudera 社区 I will be active on both places.我将在这两个地方都很活跃。

Answered in Cloudera Community by ChethanYM ChethanYMCloudera 社区中回答

Key Summary:关键摘要:

  • No hard limits on database and tables对数据库和表没有硬性限制
  • Expect performance issues if total number of objects exceeds 500,000 (defined as "objects like tables and partitions in the warehouse including column and column type information etc...")如果对象总数超过 500,000(定义为“仓库中的表和分区等对象,包括列和列类型信息等......”),则预计会出现性能问题

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM