简体   繁体   English

在Slurm群集上运行批处理作业

[英]Running Batch Job on Slurm Cluster

So I have spent a few hours now trying to figure this out and would appreciate any help. 因此,我现在花了几个小时试图解决这个问题,希望能对您有所帮助。

What I am trying to do is run a batch job with a slurm --array0-654 我想做的是使用slurm --array0-654运行批处理作业

I would like each job step to run 8 threads. 我希望每个工作步骤都能运行8个线程。

I have access to 11 nodes on the cluster each capable of 32 threads and I can't seem to get the SBATCH configuration correct. 我可以访问群集中的11个节点,每个节点具有32个线程,而且我似乎无法正确获取SBATCH配置。 It seems to either run one job per node or run over 200 of the jobs across all nodes. 它似乎要么在每个节点上运行一个作业,要么在所有节点上运行200个以上的作业。

I have tried a number of different configs and was wondering if anyone has some suggestions. 我尝试了许多不同的配置,并且想知道是否有人提出建议。

From your description, you should use 根据您的描述,您应该使用

--array 0-654
--ntasks 1
--cpus-per-task 8

but the outcome will depend on the cluster configuration (notably whether or not the nodes can be shared among nodes, presence of limits on the number of jobs, etc.) 但是结果将取决于群集配置(尤其是节点之间是否可以共享节点,作业数量是否存在限制等)。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM