为什么这个multiprocessing.pool的实现不工作?

5 投票
1 回答
6777 浏览
提问于 2025-04-18 13:11

这是我正在使用的代码:

def initFunction(arg1, arg2):
    def funct(value):
        return arg1 * arg2 * value
    return funct

os.system("taskset -p 0xff %d" % os.getpid()) 
pool = Pool(processes=4)
t = np.linspace(0,1,10e3)

a,b,c,d,e,f,g,h = sy.symbols('a,b,c,d,e,f,g,h',commutative=False)

arg1 = sy.Matrix([[a,b],[c,d]])
arg2 = sy.Matrix([[e,f],[g,h]])
myFunct = initFunction(arg1, arg2)

m3 = map(myFunct,t) # this works
m4 = pool.map(myFunct,t) # this does NOT work

我遇到的错误是:

Traceback (most recent call last):
   File "<stdin>", line 1, in <module>
   File "/usr/lib/python2.7/dist-packages/spyderlib/widgets/externalshell/sitecustomize.py", line 540, in runfile
      execfile(filename, namespace)
   File "/home/justin/Research/mapTest.py", line 46, in <module>
      m4 = pool.map(myFunct,t) 
   File "/usr/lib/python2.7/multiprocessing/pool.py", line 251, in map
      return self.map_async(func, iterable, chunksize).get()
   File "/usr/lib/python2.7/multiprocessing/pool.py", line 558, in get
      raise self._value
cPickle.PicklingError: Can't pickle <type 'function'>: attribute lookup __builtin__.function failed

那么这个错误是什么意思呢?我该如何让这个map函数支持多进程呢?

1 个回答

7

在使用 multiprocessing 时,你传递给不同进程的对象必须能够从 __main__ 模块中导入,这样它们才能在子进程中被解包。像 funct 这样的嵌套函数是无法从 __main__ 导入的,所以你会遇到那个错误。你可以通过使用 functools.partial 来实现你想要的效果:

from multiprocessing import Pool
from functools import partial

def funct(arg1, arg2, value):
    return arg1 * arg2 * value


if __name__ == "__main__":
    t = [1,2,3,4]
    arg1 = 4 
    arg2 = 5 

    pool = Pool(processes=4)
    func = partial(funct, arg1, arg2)
    m4 = pool.map(func,t)
    print(m4)

输出:

[20, 40, 60, 80]

撰写回答