Cython函数输出与Python函数输出略有不同

Question

我把一个Python函数转换成了Cython版本，主要是给一些变量加上了类型。不过，Cython函数的输出和原来的Python函数稍微有点不同。

我在这篇文章中了解到了一些导致这种差异的原因 Cython: unsigned int indices for numpy arrays gives different result 但是即使知道了这些，我还是无法让Cython函数的结果和Python函数完全一致。

所以我整理了四个函数，展示我尝试过的内容。有人能帮我找出为什么每个函数的结果会有些不同吗？还有，怎样才能让Cython函数返回和function1完全相同的值呢？我在下面做了一些注释：

%%cython
import numpy as np
cimport numpy as np    

def function1(response, max_loc):    
    x, y = int(max_loc[0]), int(max_loc[1])

    tmp1 = (response[y,x+1] - response[y,x-1]) / 2*(response[y,x] - min(response[y,x-1], response[y,x+1]))
    tmp2 = (response[y,x+1] - response[y,x-1])
    tmp3 = 2*(response[y,x] - min(response[y,x-1], response[y,x+1]))

    print tmp1, tmp2, tmp3        
    return tmp1, tmp2, tmp3

cpdef function2(np.ndarray[np.float32_t, ndim=2] response, np.ndarray[np.float64_t, ndim=1] max_loc):
    cdef unsigned int x, y 
    x, y = int(max_loc[0]), int(max_loc[1])

    tmp1 = (response[y,x+1] - response[y,x-1]) / 2*(response[y,x] - min(response[y,x-1], response[y,x+1]))        
    tmp2 = (response[y,x+1] - response[y,x-1])
    tmp3 = 2*(response[y,x] - min(response[y,x-1], response[y,x+1]))     

    print tmp1, tmp2, tmp3        
    return tmp1, tmp2, tmp3


cpdef function3(np.ndarray[np.float32_t, ndim=2] response, np.ndarray[np.float64_t, ndim=1] max_loc):     
    cdef unsigned int x, y 
    x, y = int(max_loc[0]), int(max_loc[1])

    cdef np.float32_t tmp1, tmp2, tmp3
    cdef np.float32_t r1 =response[y,x+1]
    cdef np.float32_t r2 =response[y,x-1]
    cdef np.float32_t r3 =response[y,x]
    cdef np.float32_t r4 =response[y,x-1]
    cdef np.float32_t r5 =response[y,x+1]    

    tmp1 = (r1 - r2) / 2*(r3 - min(r4, r5))  
    tmp2 = (r1 - r2)
    tmp3 = 2*(r3 - min(r4, r5))

    print tmp1, tmp2, tmp3        
    return tmp1, tmp2, tmp3

def function4(response, max_loc):     
    x, y = int(max_loc[0]), int(max_loc[1])

    tmp1 = (float(response[y,x+1]) - response[y,x-1]) / 2*(float(response[y,x]) - min(response[y,x-1], response[y,x+1]))
    tmp2 = (float(response[y,x+1]) - response[y,x-1])
    tmp3 = 2*(float(response[y,x]) - min(response[y,x-1], response[y,x+1]))

    print tmp1, tmp2, tmp3        
    return tmp1, tmp2, tmp3

max_loc = np.asarray([ 15., 25.], dtype=np.float64) 
response = np.zeros((49,49), dtype=np.float32)     
x, y = int(max_loc[0]), int(max_loc[1])

response[y,x] = 0.959878861904  
response[y,x-1] = 0.438348740339
response[y,x+1] = 0.753262758255  

result1 = function1(response, max_loc)
result2 = function2(response, max_loc)
result3 = function3(response, max_loc)
result4 = function4(response, max_loc)
print result1
print result2
print result3
print result4

然后是结果：

0.0821185777156 0.314914 1.04306030273
0.082118573023 0.314914017916 1.04306024313
0.0821185708046 0.314914017916 1.04306030273
0.082118573023 0.314914017916 1.04306024313
(0.082118577715618812, 0.31491402, 1.043060302734375)
(0.08211857302303427, 0.3149140179157257, 1.0430602431297302)
(0.08211857080459595, 0.3149140179157257, 1.043060302734375)
(0.082118573023034269, 0.31491401791572571, 1.0430602431297302)

function1代表了我在原始Python函数中做的操作。tmp1是结果。

function2是我第一个Cython版本，结果稍微有点不同。显然，如果用带类型的变量（无符号整数或整数）来索引响应数组，结果会被强制转换为双精度浮点数（使用PyFloat_FromDouble），即使数组的类型是np.float32_t。但是如果用Python整数来索引数组，就会使用PyObject_GetItem函数，这样我得到的就是np.float32_t，这和function1的情况一样。因此，function1中的表达式是用np.float32_t类型的操作数计算的，而function2中的表达式是用双精度浮点数计算的。所以我在function1中打印的结果和function2稍微有点不同。

function3是我第二次尝试Cython，想要得到和function1相同的输出。在这里，我使用无符号整数索引来访问响应数组，但结果保留在np.float32_t的中间变量中，然后再用这些变量进行计算。结果还是稍微有点不同。显然，打印语句会使用PyFloat_FromDouble，所以它无法打印np.float32_t。

接着，我尝试把Python函数改成和Cython的版本一致。function4试图通过在每个表达式中至少将一个操作数转换为浮点数来实现，这样其他操作数也会被强制转换为Python浮点数，而在Cython中这就是双精度浮点数，表达式也就用双精度浮点数计算，就像在function2中一样。函数内部的打印结果和function2完全相同，但返回的值却稍微有点不同？！

性能优化浮点数 numpy 数据类型索引 cython 函数输出强制转换

Cython函数输出与Python函数输出略有不同

2 个回答

撰写回答