Limit Python Threads : Resource Temporarily Unavailable

Question

I use python threading.Thread to spawn threads that execute a small utility for every filename found in os.walk() and get its output. I tried limiting number of threads using:

ThreadLimiter = threading.BoundedSemaphore(3)

and

ThreadLimiter.acquire()

in start of run method and

ThreadLimiter.release()

at end of run method

But I still get the below error message when I run the python program. Any suggestions on improving this ?

bash: fork: retry: Resource temporarily unavailable
bash: fork: retry: Resource temporarily unavailable

Answer 1

Use a thread pool and save yourself a lot of work! Here I md5sum files:

import os
import multiprocessing.pool
import subprocess as subp

def walker(path):
    """Walk the file system returning file names"""
    for dirpath, dirs, files in os.walk(path):
        for fn in files:
            yield os.path.join(dirpath, fn)

def worker(filename):
    """get md5 sum of file"""
    p = subp.Popen(['md5sum', filename], stdin=subp.PIPE,
            stdout=subp.PIPE, stderr=subp.PIPE)
    out, err = p.communicate()
    return filename, p.returncode, out, err

pool = multiprocessing.pool.ThreadPool(3)

for filename, returncode, out, err in pool.imap(worker, walker('.'), chunksize=1):
    print(filename, out.strip())

Answer 2

When run executes, the Thread has already started. Acquiring the semaphore will block additional, keeping them alive but inactive. Using a limit inside of run will not limit the number of running threads but of finishing threads - making the problem worse!

Either:

Modify start to delay launching the threads.
In your os.walk loop, keep a list of active threads and block using thread.join when there are too many.
Use a thread pool, eg multiprocessing.pool.ThreadPool .

Limit Python Threads : Resource Temporarily Unavailable

Question

1 answers

solution1
1 ACCPTED 2016-10-05 21:13:22

solution2
0 2016-10-05 21:06:33

Limit Python Threads : Resource Temporarily Unavailable

Question

1 answers

solution1 1 ACCPTED 2016-10-05 21:13:22

solution2 0 2016-10-05 21:06:33

solution1
1 ACCPTED 2016-10-05 21:13:22

solution2
0 2016-10-05 21:06:33