使用多个布尔列筛选pandas数据帧

网友

1楼 · 编辑于 2024-05-23 21:58:33

In [82]: d
Out[82]:
             A   B      C      D
0     John Doe  45   True  False
1   Jane Smith  32  False  False
2  Alan Holmes  55  False   True
3   Eric Lamar  29   True   True

解决方案1：

In [83]: d.loc[d.C | d.D]
Out[83]:
             A   B      C      D
0     John Doe  45   True  False
2  Alan Holmes  55  False   True
3   Eric Lamar  29   True   True

解决方案2：

In [94]: d[d[['C','D']].any(1)]
Out[94]:
             A   B      C      D
0     John Doe  45   True  False
2  Alan Holmes  55  False   True
3   Eric Lamar  29   True   True

解决方案3：

In [95]: d.query("C or D")
Out[95]:
             A   B      C      D
0     John Doe  45   True  False
2  Alan Holmes  55  False   True
3   Eric Lamar  29   True   True

PS如果您将解决方案更改为：

df[(df['C']==True) | (df['D']==True)]

也会有用的

Pandas docs - boolean indexing

why we should NOT use "PEP complaint" df["col_name"] is True instead of df["col_name"] == True?

In [11]: df = pd.DataFrame({"col":[True, True, True]})

In [12]: df
Out[12]:
    col
0  True
1  True
2  True

In [13]: df["col"] is True
Out[13]: False               # <----- oops, that's not exactly what we wanted

网友

2楼 · 编辑于 2024-05-23 21:58:33

万岁！更多选择！

`np.where`

df[np.where(df.C | df.D, True, False)]

             A   B      C      D
0     John Doe  45   True  False
2  Alan Holmes  55  False   True
3   Eric Lamar  29   True   True

`pd.Series.where`在`df.index`

df.loc[df.index.where(df.C | df.D).dropna()]

               A   B      C      D
0.0     John Doe  45   True  False
2.0  Alan Holmes  55  False   True
3.0   Eric Lamar  29   True   True

`df.select_dtypes`

df[df.select_dtypes([bool]).any(1)]   

             A   B      C      D
0     John Doe  45   True  False
2  Alan Holmes  55  False   True
3   Eric Lamar  29   True   True

滥用`np.select`

df.iloc[np.select([df.C | df.D], [df.index])].drop_duplicates()

             A   B      C      D
0     John Doe  45   True  False
2  Alan Holmes  55  False   True
3   Eric Lamar  29   True   True

网友

3楼 · 编辑于 2024-05-23 21:58:33

或者

d[d.eval('C or D')]

Out[1065]:
             A   B      C      D
0     John Doe  45   True  False
2  Alan Holmes  55  False   True
3   Eric Lamar  29   True   True

`np.where`

`pd.Series.where`在`df.index`

`df.select_dtypes`

滥用`np.select`

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用多个布尔列筛选pandas数据帧

np.where

pd.Series.where在df.index

df.select_dtypes

滥用np.select

相关问题 更多 >

编程相关推荐

热门问题

热门文章

`np.where`

`pd.Series.where`在`df.index`

`df.select_dtypes`

滥用`np.select`

相关问题更多 >