简体繁体中英

What are the major differences between S3 lake formation governed tables and databricks delta tables?

原文 2021-12-06 12:01:54 2 2 amazon-s3/ databricks/ delta-lake/ aws-lake-formation

What are the major differences between S3 lake formation governed tables and databricks delta tables? they look pretty similar.

2 answers

Governed tables, Delta Lake, and to some extent also Apache Iceberg and Hudi are all tabular data formats. So instead of storing data in raw formats (parquet, orc, avro) they all have an additional manifest files which provides metadata about which files are present in a table during a certain state. This allows them all to enable features like ACID transactions, time-travel, and snapshotting.

The main difference right now is which big data tools they can integrate with.

AWS Governed tables has tight integration with all of AWS. It can easily leverage the Lake Formation permission model to govern access of data catalog objects (database, table, and column). It also allows you to use AWS query engines: Redshift Spectrum and Athena. Spark is not yet supported.

Delta Lakes provides ACID transactions, time traveling, and snapshotting on Spark. It also supports Spark streaming and data mutation.

What would then be the difference between Glue tables and Governed tables and also with the Hudi, Iceberg and Delta Lake?

Glue tables allow also to query S3 parquet files from Athena, Redshift Spectrum, Glue and from a Spark job.

Writing delta lake to AWS S3 (Without Databricks)

AWS Lake Formation: Insufficient Lake Formation permission(s) on s3://abc/

Creating hive tables in S3 bucket using databricks

What is the difference between a data lake with HDFS or S3 in AWS?

Correct Method to Delete Delta Lake Partion on AWS s3

How to fix corrupted delta lake table on AWS S3

AWS GLUE Not able to write Delta lake in s3

Copy delta data from AWS S3 to Azure Data Lake Storage Gen2 failed

Delta Lake (OSS) Table on EMR and S3 - Vacuum takes a long time with no jobs

java.lang.NoClassDefFoundError: org/apache/spark/sql/catalyst/plans/logical/AnalysisHelper while writing delta-lake into s3 storage

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Writing delta lake to AWS S3 (Without Databricks) AWS Lake Formation: Insufficient Lake Formation permission(s) on s3://abc/ Creating hive tables in S3 bucket using databricks What is the difference between a data lake with HDFS or S3 in AWS? Correct Method to Delete Delta Lake Partion on AWS s3 How to fix corrupted delta lake table on AWS S3 AWS GLUE Not able to write Delta lake in s3 Copy delta data from AWS S3 to Azure Data Lake Storage Gen2 failed Delta Lake (OSS) Table on EMR and S3 - Vacuum takes a long time with no jobs java.lang.NoClassDefFoundError: org/apache/spark/sql/catalyst/plans/logical/AnalysisHelper while writing delta-lake into s3 storage

Related Tags

What are the major differences between S3 lake formation governed tables and databricks delta tables?

Question

2 answers

solution1
0 2021-12-09 02:18:14

solution2
0 2022-08-07 16:13:02

What are the major differences between S3 lake formation governed tables and databricks delta tables?

Question

2 answers

solution1 0 2021-12-09 02:18:14

solution2 0 2022-08-07 16:13:02

solution1
0 2021-12-09 02:18:14

solution2
0 2022-08-07 16:13:02