numpy高级索引：透明优化范围？

>>> a = np.arange(1_000_000) >>> direct = lambda: np.sum(a[:]) >>> indirect = lambda: np.sum(a[a]) >>> timeit(direct, number=100) 0.07656216900795698 >>> timeit(indirect, number=100) 0.2885982050211169

class smart_idx: def __init__(self, n): self.n = n def __getitem__(self, idx): idx = idx if isinstance(idx, tuple) else (idx,) if idx: count = idx.count('X') need_adv = count > 1 if count == 1: for i in idx: if not isinstance(i, slice) and i != Ellipsis: need_adv = True break repl = np.arange(self.n) if need_adv else slice(None) return tuple(repl if i == 'X' else i for i in idx) return slice(None)

1条回答

网友

1楼 · 发布于 2024-06-09 21:55:03

In [104]: x=np.arange(12).reshape(4,3)

虽然一个是副本，另一个是视图，但它们看起来是一样的：

In [107]: x[np.arange(0,4,2),:]
Out[107]: 
array([[0, 1, 2],
       [6, 7, 8]])
In [108]: x[0:4:2,:]
Out[108]: 
array([[0, 1, 2],
       [6, 7, 8]])

但是如果第二个索引是一个数组，那么arange和slice就不是替代品。你知道吗

In [109]: idx=np.array([0,2])
In [110]: x[np.arange(0,4,2),idx]
Out[110]: array([0, 8])
In [111]: x[0:4:2,idx]
Out[111]: 
array([[0, 2],
       [6, 8]])

为了匹配切片版本，我必须向arange添加一个维度。你知道吗

In [113]: x[np.ix_(np.arange(0,4,2),idx)]
Out[113]: 
array([[0, 2],
       [6, 8]])
In [114]: x[np.arange(0,4,2)[:,None],idx]
Out[114]: 
array([[0, 2],
       [6, 8]])

我不知道一个切片表达式会产生Out[110]。你知道吗

因此，除了用slice替换arange之外，我们还需要注意高级索引数组如何相互广播，以及切片意味着什么广播。你知道吗

对于3维或更多维，混合切片和高级索引变得更加复杂，如https://docs.scipy.org/doc/numpy/reference/arrays.indexing.html#combining-advanced-and-basic-indexing中所述

相关问题更多 >

编程相关推荐

热门问题

热门文章