pandas从列中删除特定序列

2条回答

网友

1楼 · 编辑于 2024-04-25 14:53:53

有一种方法：

import numpy as np
import pandas as pd

def find_drops(seq, df):
    if seq:
        m = np.logical_and.reduce([df.num.shift(-i).eq(seq[i]) for i in range(len(seq))])
        if len(seq) == 1:
            return pd.Series(m, index=df.index)
        else:
            return pd.Series(m, index=df.index).replace({False: np.NaN}).ffill(limit=len(seq)-1).fillna(False)
    else:
        return pd.Series(False, index=df.index)


find_drops([1], df)
#0     True
#1     True
#2    False
#3    False
#4     True
#5    False
#6    False
#7    False
#dtype: bool

find_drops([1,1,2,3], df)
#0     True
#1     True
#2     True
#3     True
#4    False
#5    False
#6    False
#7    False
#dtype: bool

然后使用这些序列来切片df[~find_drops([1,5], df)]

网友

2楼 · 编辑于 2024-04-25 14:53:53

你看了^{}了吗？默认值为keep=first。所以你可以简单地做：

datatry.loc[datatry['num'].duplicated(), :]

相关问题更多 >

编程相关推荐

热门问题

热门文章

pandas从列中删除特定序列

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >