简体   繁体   English

如何找到已完成的 SLURM 作业的排队时间?

[英]How do I find the queuing time for completed SLURM jobs?

I am trying to find a way to see the waiting time of completed jobs until they started on a cluster with SLURM as the Workload manager.我试图找到一种方法来查看已完成作业的等待时间,直到它们在以 SLURM 作为工作负载管理器的集群上启动。

I know the "sacct" command can print a lot of information about completed jobs but I don't find anything useful for my task.我知道“sacct”命令可以打印很多关于已完成作业的信息,但我没有发现任何对我的任务有用的信息。

Perfect would be either:完美将是:

  • The time the job was queuing until it started作业在开始之前排队的时间
  • A timestamp or date of the time I submitted the job (then I can calculate back the waiting time, because "sacct" gives information about the time the job started with the "Start" field)我提交作业的时间戳或日期(然后我可以计算等待时间,因为“sacct”通过“开始”字段提供有关作业开始时间的信息)

Do you have any useful tips?你有什么有用的提示吗?

The sacct command has that: sacct -o start,submit should give all the info you need. sacct命令具有: sacct -o start,submit应该提供您需要的所有信息。

The queue time for a job can be obtained from the sacct command like this:可以从sacct命令获取作业的排队时间,如下所示:

sacct -X  -j <jobid> -o Reserved

The Reserved column is, (from the sacct manpage ) Reserved列是,(来自sacct 手册页

Reserved How much wall clock time was used as reserved time for this job.保留 多少挂钟时间用作此作业的保留时间。 This is derived from how long a job was waiting from eligible time to when it actually started.这是根据作业从符合条件的时间到实际开始的等待时间得出的。 Format is the same as Elapsed.格式与 Elapsed 相同。

Note this has nothing to do with so-called "advanced resource reservations" in Slurm.请注意,这与 Slurm 中所谓的“高级资源预留”无关。

Basically the time a job is "submitted" and that job is "eligible" is most often the same, except in some cases, for instance when the job is submitted with --hold , or with --dependency .基本上,作业“提交”和该作业“合格”的时间通常是相同的,除了在某些情况下,例如当作业通过--hold--dependency提交时。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM