python multiprocessing .join() 死锁依赖于worker函数 [英] python multiprocessing .join() deadlock depends on worker function

查看：38 发布时间：2022/1/12 12:50:50 python join multiprocessing python-multiprocessing

本文介绍了python multiprocessing .join() 死锁依赖于worker函数的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用 multiprocessing python生成库 4 Process() 对象以并行化 CPU 密集型任务.任务(灵感和代码来自这个伟大的 article) 是计算列表中每个整数的素因子.

I am using the multiprocessing python library to spawn 4 Process() objects to parallelize a cpu intensive task. The task (inspiration and code from this great article) is to compute the prime factors for every integer in a list.

main.py:

import random
import multiprocessing
import sys

num_inputs  = 4000
num_procs   = 4
proc_inputs = num_inputs/num_procs
input_list  = [int(1000*random.random()) for i in xrange(num_inputs)]

output_queue = multiprocessing.Queue()
procs        = []
for p_i in xrange(num_procs):
  print "Process [%d]"%p_i
  proc_list = input_list[proc_inputs * p_i:proc_inputs * (p_i + 1)]
  print " - num inputs: [%d]"%len(proc_list)

  # Using target=worker1 HANGS on join
  p = multiprocessing.Process(target=worker1, args=(p_i, proc_list, output_queue))
  # Using target=worker2 RETURNS with success
  #p = multiprocessing.Process(target=worker2, args=(p_i, proc_list, output_queue))

  procs.append(p)
  p.start()

for p in jobs:
  print "joining ", p, output_queue.qsize(), output_queue.full()
  p.join()
  print "joined  ", p, output_queue.qsize(), output_queue.full()

print "Processing complete."
ret_vals = []
while output_queue.empty() == False:
    ret_vals.append(output_queue.get())
print len(ret_vals)
print sys.getsizeof(ret_vals)

观察:

如果每个进程的目标是函数 worker1，对于大于 4000 个元素的输入列表，主线程会卡在 .join() 上，等待产生的进程终止并且永不返回.
如果每个进程的目标是函数 worker2，对于相同的输入列表，代码工作正常，主线程返回.

If the target for each process is the function worker1, for an input list larger than 4000 elements the main thread gets stuck on .join(), waiting for the spawned processes to terminate and never returns.
If the target for each process is the function worker2, for the same input list the code works just fine and the main thread returns.

这让我很困惑，因为 worker1 和 worker2 之间的唯一区别(见下文)是前者在 Queue 而后者为每个进程插入一个列表.

This is very confusing to me, as the only difference between worker1 and worker2 (see below) is that the former inserts individual lists in the Queue whereas the latter inserts a single list of lists for each process.

为什么使用 worker1 而没有使用 worker2 目标会出现死锁?两者(或都不)不应该超出多处理队列最大大小限制为 32767?

Why is there deadlock using worker1 and not using worker2 target? Shouldn't both (or neither) go beyond the Multiprocessing Queue maxsize limit is 32767?

worker1 与 worker2:

def worker1(proc_num, proc_list, output_queue):
    '''worker function which deadlocks'''  
    for num in proc_list:
        output_queue.put(factorize_naive(num))

def worker2(proc_num, proc_list, output_queue):
    '''worker function that works'''
    workers_stuff = []

    for num in proc_list:
        workers_stuff.append(factorize_naive(num))
    output_queue.put(workers_stuff)

<小时>

关于 SO 有很多类似的问题，但我相信这些问题的核心显然与所有问题不同.

There are a lot of similar questions on SO, but I believe the core of this questions is clearly distinct from all of them.

相关链接:

python multiprocessing .join() 死锁依赖于worker函数 [英] python multiprocessing .join() deadlock depends on worker function

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

python multiprocessing .join() 死锁依赖于worker函数 [英] python multiprocessing .join() deadlock depends on worker function

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭