简体   繁体   中英

How to deploy modified airflow dag from a different start time?

Lets say scheduler is stopped for 5 hours and I had dag scheduled for twice every hour. Now when I restart the scheduler I do not want to airflow to backfill all the instances those were missed, Instead I want it to continue from the current hour.

To achieve this behavior, you can use the LatestOnlyOperator , which was just recently introduced to master , to the start of your DAG. It is not currently part of a released version though ( is the latest version as of the writing of this post).

I'm sure you're no longer waiting for an answer, but for reference, this is covered here: https://cwiki.apache.org/confluence/display/AIRFLOW/Common+Pitfalls .

"When needing to change your start_date and schedule interval, change the name of the dag (aka dag_id) - I follow the convention : my_dag_v1, my_dag_v2, my_dag_v3, my_dag_v4, etc..."

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

粤ICP备18138465号  © 2020-2024 STACKOOM.COM