[英]Databricks is "Updating the Delta table's state"
I'm reading and joining multiple delta tables from a Datalake and store the result back to another Deltalake location.我正在从 Datalake 读取并加入多个增量表,并将结果存储回另一个 Deltalake 位置。 When doing so, Databricks is showing me :
这样做时,Databricks 向我展示:
Depending on how many delta tables I join with each other, this can take up to very long time.根据我相互连接的增量表的数量,这可能需要很长时间。 Even tough the joining itself would just take up to a few minutes, the state update takes up to an hour.
即使加入本身也只需要几分钟,状态更新需要长达一个小时。
What is happening when I see Updating the Delta table's state
?当我看到
Updating the Delta table's state
什么? Can I somehow optimize this?我可以以某种方式优化它吗?
Thank you Karthikeyan Rasipalay Durairaj , Posting your suggestion as an answer to help other community members.谢谢Karthikeyan Rasipalay Durairaj ,发布您的建议作为帮助其他社区成员的答案。
Updating the Delta table's state.
更新 Delta 表的状态。
The command status report means ,命令状态报告意味着,
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.