简体   繁体   中英

Does Apache Flink cache jobs?

I noticed that each time I run a new job it takes around 20% longer compared to the time when I launch it again?

Does flink cache some results and reuses them if a job is run multiple times? If so, how can I control this?

I would like to mesure how long my tasks run, but each time I rerun them it's faster than before.

If you using some stateful functions and configured checkpoints and savepoints your job can take a time to restore the state from checkpoints.

In order to make state fault tolerant, Flink needs to checkpoint the state. Checkpoints allow Flink to recover state and positions in the streams to give the application the same semantics as a failure-free execution.

More about checkpointing and here .

Monitoring checkpointing

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM