简体   繁体   English

AWS Glue Catalog作业上的MSCK修复命令

[英]MSCK Repair Command on AWS Glue Catalog job

Can we have an AWS Glue job scheduled to perform MSCK repair commands so that the metadata for newly added partition gets added to Glue Catalog? 我们可以安排一个AWS Glue作业执行MSCK修复命令,以便将新添加的分区的元数据添加到Glue目录中吗?

Can Glue ETL script perform MSCK REPAIR TABLE command without calling Athena? Glue ETL脚本可以在不调用Athena的情况下执行MSCK REPAIR TABLE命令吗?

This is achieved by Glue Crawlers. 这是通过胶粘剂爬行者实现的。 If you create a crawler it will update the table based on new fields and add new partitions. 如果创建搜寻器,它将基于新字段更新表并添加新分区。

You can call batch_create_partition() API to do it. 您可以调用batch_create_partition()API来执行此操作。 It doesn't require expensive operations like MSCK REPAIR TABLE or re-crawling. 它不需要像MSCK REPAIR TABLE或重新爬网这样的昂贵操作。 Below is my detailed answer with code sample - 以下是我的详细答案和代码示例-

https://stackoverflow.com/a/52239022/2414855 https://stackoverflow.com/a/52239022/2414855

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 AWS更新Athena meta:胶履带vs MSCK维修表 - AWS update Athena meta: Glue Crawler vs MSCK Repair Table AWS 胶水作业 (Pyspark) 到 AWS 胶水数据目录 - AWS glue job (Pyspark) to AWS glue data catalog AWS Glue Job - 将粘合目录表名称作为参数传递 - AWS Glue Job - pass glue catalog table names as parameters 删除数百个分区后,msck 修复表命令在 AWS Athena 中运行缓慢 - msck repair table command runs slow in AWS Athena after deleting few hundred partitions AWS Glue Crawler 在没有 Glue Job 的情况下将所有数据发送到 Glue Catalog 和 Athena - AWS Glue Crawler sends all data to Glue Catalog and Athena without Glue Job AWS Glue Python 作业未创建新的数据目录分区 - AWS Glue Python Job not creating new Data Catalog partitions 如何在 AWS Athena 中自动执行 MSCK REPAIR TABLE - How to make MSCK REPAIR TABLE execute automatically in AWS Athena AWS Athena MSCK REPAIR TABLE 对于小数据集花费的时间太长 - AWS Athena MSCK REPAIR TABLE takes too long for a small dataset 在 AWS Athena 中运行多个 MSCK REPAIR TABLE 语句 - Running multiple MSCK REPAIR TABLE statements in AWS Athena MSCK 修复表命令仅在重新创建表后才有效 - MSCK Repair table command works only after the table is recreated
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM