简体   繁体   中英

Slurm oversubscribe GPUs

Is there a way to oversubscribe GPUs on Slurm, ie run multiple jobs/job steps that share one GPU? We've only found ways to oversubscribe CPUs and memory, but not GPUs.

We want to run multiple job steps on the same GPU in parallel and optionally specify the GPU memory used for each step.

这样做的最简单的方法是有定义为GPU的feature而不是作为一个gres所以SLURM不会管理的GPU,只要确保工作需要的,提供一个节点一个土地。

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM