简体繁体中英

Can I specify a max memory for the default dask scheduler on a shared machine?

原文 2020-06-25 19:16:29 0 1 python/ dask/ dask-distributed

I am working on a shared analysis node at Princeton University.

I often encounter problems with my dask processes being killed due to large memory consumption. This seems to happen as a precaution from the admin side, to avoid an unstable system.

To control the resources I usually use a LocalCluster via dask.distributed, but in this particular instance this prevents me from using a numerically efficient algorithm implemented with numba (see here for a discussion of the problem ).

I did find an answer for specifying the amount of threads to be used here , but is there a similar way to specify a maximum amount of memory for the threaded scheduler?

1 answers

No, Dask will not control the memory of the scheduler process. If it is growing large in memory then this is a sign that you're probably misusing it a bit. Ideally the scheduler never stores any of your data.

How to specify the number of threads/processes for the default dask scheduler

Shared Memory with Dask

dask: shared memory in parallel model

How to increase scheduler memory in GKE for DASK

Dask shared futures with Channels and memory usage

Dask on single OSX machine - is it parallel by default?

How can I specify row order when I use dask.dataframe

Can I set max task or max memory limits for gevent pools?

Dask with HTCondor scheduler

Kubernetes and Dask and Scheduler

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to specify the number of threads/processes for the default dask scheduler Shared Memory with Dask dask: shared memory in parallel model How to increase scheduler memory in GKE for DASK Dask shared futures with Channels and memory usage Dask on single OSX machine - is it parallel by default? How can I specify row order when I use dask.dataframe Can I set max task or max memory limits for gevent pools? Dask with HTCondor scheduler Kubernetes and Dask and Scheduler

Related Tags

Can I specify a max memory for the default dask scheduler on a shared machine?

Question

1 answers

solution1 0 2020-06-27 16:35:23

solution1
0 2020-06-27 16:35:23