简体   繁体   English

Apache Flink 缓存作业吗?

[英]Does Apache Flink cache jobs?

I noticed that each time I run a new job it takes around 20% longer compared to the time when I launch it again?我注意到每次我运行一项新工作时,与再次启动它的时间相比,它所花费的时间大约长 20%?

Does flink cache some results and reuses them if a job is run multiple times?如果一个作业多次运行,flink 是否会缓存一些结果并重用它们? If so, how can I control this?如果是这样,我该如何控制?

I would like to mesure how long my tasks run, but each time I rerun them it's faster than before.我想测量我的任务运行多长时间,但每次我重新运行它们时都比以前快。

If you using some stateful functions and configured checkpoints and savepoints your job can take a time to restore the state from checkpoints.如果您使用一些有状态的函数和配置的检查点和保存点,您的工作可能需要一些时间才能从检查点恢复 state。

In order to make state fault tolerant, Flink needs to checkpoint the state.为了使 state 容错,Flink 需要检查点 state。 Checkpoints allow Flink to recover state and positions in the streams to give the application the same semantics as a failure-free execution.检查点允许 Flink 恢复 state 和流中的位置,从而为应用程序提供与无故障执行相同的语义。

More about checkpointing and here .更多关于检查点这里

Monitoring checkpointing 监控检查点

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM