I'm reading and joining multiple delta tables from a Datalake and store the result back to another Deltalake location. When doing so, Databricks is showing me :
Depending on how many delta tables I join with each other, this can take up to very long time. Even tough the joining itself would just take up to a few minutes, the state update takes up to an hour.
What is happening when I see Updating the Delta table's state
? Can I somehow optimize this?
Thank you Karthikeyan Rasipalay Durairaj , Posting your suggestion as an answer to help other community members.
Updating the Delta table's state.
The command status report means ,
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.