如何根据设置的条件在pandas数据帧中转发填充非空值

2条回答

网友

1楼 · 编辑于 2024-04-26 02:33:19

一种方法是用nan来^{}下零：

In [11]: df.replace(0, np.nan).bfill()  # maybe neater way to do this?
Out[11]:
             a   b   c
2000-03-02   1   1   1
2000-03-03   1   1   1
2000-03-04   1   1   1
2000-03-05   1 NaN NaN
2000-03-06 NaN NaN NaN
2000-03-07 NaN NaN NaN

现在可以使用^{}将这些更改为2：

^{pr2}$

编辑：在这里使用cumsum的技巧可能会更快：

In [21]: %timeit df.where(df.replace(0, np.nan).bfill(), 2)
100 loops, best of 3: 2.34 ms per loop

In [22]: %timeit df.where(df[::-1].cumsum()[::-1], 2)
1000 loops, best of 3: 1.7 ms per loop

In [23]: %timeit pd.DataFrame(np.where(np.cumsum(df.values[::-1], 0)[::-1], df.values, 2), df.index)
10000 loops, best of 3: 186 µs per loop

网友

2楼 · 编辑于 2024-04-26 02:33:19

这是一个非常通用的解决方案（例如，如果索引是非连续的，您将失败）。第一部分，得到索引器是相当大的麻烦！在

In [64]: indexer = Series(df.index.get_indexer(df.diff().idxmin().values),index=df.columns)

In [65]: indexer
Out[65]: 
a    4
b    3
c    3
dtype: int64

我认为这是一种矢量化的方法，你所要做的就是根据上面的索引器构造正确的布尔矩阵，但是会让我的大脑受伤。在

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何根据设置的条件在pandas数据帧中转发填充非空值

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >