简体   繁体   English

将S3用于HIVE元存储库?

[英]Use S3 for HIVE metastore?

I'm configuring S3 to be a data warehouse for Hive. 我将S3配置为Hive的数据仓库。 However, I'm not sure if the 'metastore' part of Hive can use S3 as a storage or not. 但是,我不确定Hive的“元存储”部分是否可以使用S3作为存储。

Most tutorials (ie https://data-flair.training/blogs/apache-hive-metastore/ ) mention that the metastore only supports Derby and a few other RDBMS. 大多数教程(即https://data-flair.training/blogs/apache-hive-metastore/ )都提到元存储仅支持Derby和其他一些RDBMS。

Is that true? 真的吗? Configuring metastore to use s3 as storage sounds like a simple config change to me. 配置metastore以将s3用作存储听起来对我来说是一个简单的配置更改。

The Hive Metastore persists the metadata for Hive tables and partitions. Hive Metastore保留Hive表和分区的元数据。 It is backed by a relational database, and currently MySQL, Postgres, Oracle, and MS SQL Server are supported. 它由关系数据库支持,目前支持MySQL,Postgres,Oracle和MS SQL Server S3 is an object store , not a relational database. S3是对象存储 ,而不是关系数据库。

This is the ER diagram showing the various tables, columns and constraints stored in the Hive Metastore. 这是ER图,显示了Hive Metastore中存储的各种表,列和约束。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM