几个numba函数的并行编译

import numpy as np from numba import jit, float64 @jit(float64[:](float64[:]),nopython=True,cache=True,fastmath=True,parallel=True,nogil=True) def fun_0(x): return np.power(x,0) @jit(float64[:](float64[:]),nopython=True,cache=True,fastmath=True,parallel=True,nogil=True) def fun_1(x): return np.power(x,1) @jit(float64[:](float64[:]),nopython=True,cache=True,fastmath=True,parallel=True,nogil=True) def fun_2(x): return np.power(x,2) @jit(float64[:](float64[:]),nopython=True,cache=True,fastmath=True,parallel=True,nogil=True) def fun_3(x): return np.power(x,3)

1条回答

网友

1楼 · 发布于 2024-05-01 21:51:55

我们的目标是提高运行时性能吗？几千个独立函数的@decorator-“内联”JIT编译会降低运行时性能吗

解决方案：
可以使用“AoT编译：即提前编译代码”

_{While Numba’s main use case is Just-in-Time compilation, it also provides a facility for Ahead-of-Time compilation (AOT).

Limitations :

1) AOT compilation only allows for regular functions, not ufuncs.

2) You have to specify function signatures explicitly.

3) Each exported function can have only one signature (but you can export several different signatures under different names).

4) AOT compilation produces generic code for your CPU’s architectural family (for example “x86-64”), while JIT compilation produces code optimized for your particular CPU model.}

警告：
您的代码按原样似乎容易陷入Amdahl's Law Trap

您将很容易支付更多的附加组件开销，这比从（仅仅）潜在的处理加速中得到的任何回报都要多，因为与更密集的内存I/O和CPU核心缓存重用机制相关的约束（绝对I/O上限和重用效率的相对损失）做并且仍然会违背您提高绩效的愿望）

At least, you have been warned :o)

解决方案：
可以使用“AoT编译：即提前编译代码”

警告：
您的代码按原样似乎容易陷入Amdahl's Law Trap

相关问题更多 >

编程相关推荐

热门问题

热门文章