如果在数据帧中找到行的列值，请从该行中删除该行

df1 = { 'vouchers': [100, 200, 300, 400], 'units': [11, 12, 12, 13], 'some_other_data': ['a', 'b', 'c', 'd'], } df2 = { 'vouchers': [500, 200, 600, 300], 'units': [11, 12, 12, 13], 'some_other_data': ['b', 'd', 'c', 'a'], }

3条回答

网友

1楼 · 编辑于 2024-04-18 06:06:54

使用pd.Index.isin可以有效地完成索引操作：

u = df1.set_index(['vouchers', 'units'])
df1[~u.index.isin(pd.MultiIndex.from_arrays([df2.vouchers, df2.units]))]

   vouchers  units some_other_data
0       100     11               a
2       300     12               c
3       400     13               d

网友

2楼 · 编辑于 2024-04-18 06:06:54

使用mergeindicator，在我们得到需要删除的index之后，使用drop

idx=df1.merge(df2,on=['vouchers','units'],indicator=True,how='left').\
     loc[lambda x : x['_merge']=='both'].index
df1=df1.drop(idx,axis=0)
df1
Out[374]: 
   vouchers  units some_other_data
0       100     11               a
2       300     12               c
3       400     13               d

网友

3楼 · 编辑于 2024-04-18 06:06:54

虽然我们有很多好的答案，但问题似乎很有趣，因此作为学习，我承认这是非常有兴趣的，并想提出另一个版本，它看起来有点简单，使用布尔表达式：

第一个数据帧：

>>> df1
   vouchers  units some_other_data
0       100     11               a
1       200     12               b
2       300     12               c
3       400     13               d

第二个数据帧：

^{pr2}$

可能更简单的答案：

>>> df1[(df1 != df2).any(1)]
   vouchers  units some_other_data
0       100     11               a
2       300     12               c
3       400     13               d

解决方案2:使用merge+indicator+query

>>> df1.merge(df2, how='outer', indicator=True).query('_merge == "left_only"').drop('_merge', 1)
   vouchers  units some_other_data
0       100     11               a
2       300     12               c
3       400     13               d

解决方案3:

>>> df1[~df1.isin(df2).all(axis=1)]
   vouchers  units some_other_data
0       100     11               a
2       300     12               c
3       400     13               d

相关问题更多 >

编程相关推荐

热门问题

热门文章