如何在python中使用numpy数组进行SIMD处理？

1条回答

网友

1楼 · 发布于 2024-05-16 23:35:44

您可以让每个进程保留自己的输入数组副本。但它们不能写入共享输出数组；这就是使用子进程而不是线程的全部意义。（在线程中，全局解释器锁可能会阻止线程同时运行。）

在Linux（可能还有MacOS）中，初始化的全局变量将由写时复制的子进程继承；只要子进程不尝试写入，变量将使用共享内存。在Windows中，必须为每个工作者初始化此类全局变量

这就是如何做到这一点：

import numpy as np
from multiprocessing import Pool

PERSISTENT_DATA = {}

def func(ij):
    i, j = ij
    return PERSISTENT_DATA['a'][i] + PERSISTENT_DATA['b'][j]

def init_persistent_data(a, b):
    PERSISTENT_DATA['a'] = a
    PERSISTENT_DATA['b'] = b

def run_parallel():
    n, m = 10, 5
    np.random.seed(1)
    a = np.random.randint(10, size=(n, m))
    b = np.random.randint(10, size=(n, m))
    
    # In Linux, these are inherited by the subprocesses.
    init_persistent_data(a, b)
    ij_tuples = [(0, 1), (1, 2)]
    
    # In Linux, leave the initializer and initargs out.
    with Pool(
        processes=4,
        initializer=init_persistent_data, 
        initargs=(a, b)
        ) as pl:
        result = pl.map(func, ij_tuples)
       
    result = np.array(result)


if __name__ == '__main__':
    run_parallel()

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何在python中使用numpy数组进行SIMD处理？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >