简体   繁体   中英

How to get the name of failed flink jobs

Our flink cluster sometimes restarts and all jobs will be restarted. Occasionally, some job failed to restart and failed count increases on the panel. However, it cannot let us know which jobs failed.

When total job count grows, it becomes harder to find out the stopped job. Does anyone know how can I get the names of the failed jobs? 在此处输入图片说明

You could write a simple script for that which will give you the list of job names which have failed.

I am using this command to get a list of failed job.

$yarn application -list -appStates KILLED

Set up alert when your cluster restarts and post restart check the jobs that haven't restarted and you could have alerts for those as well.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM