Pandas比较特定的行

winner newcol2 0 white black 1 black white 2 white white 3 draw white 4 black draw conditions1 = [ (x['winner'] == 'white'), (x['winner'] == 'draw'), (x['winner'] == 'black')] conditions2 = [ (x['newcol2'] == 'white'), (x['newcol2'] == 'draw'), (x['newcol2'] == 'black')] x['result'] = np.select(conditions1, conditions2, default='null')

2条回答

网友

1楼 · 编辑于 2024-06-07 07:21:38

除了您给出的示例之外，我不确定您的所有条件是什么，但这将起作用：

In [23]: conditions = []


In [24]: for row in df.itertuples(): 
...:     if row.winner == 'white' and row.newcol2 == 'black': 
...:         conditions.append(1) 
...:     elif row.winner == 'black' and row.newcol2 == 'white': 
...:         conditions.append(1) 
...:     else: 
...:         conditions.append(0) 
...:                                                                        

In [25]: conditions                                                             
Out[25]: [1, 1, 0, 0, 0]

In [26]: df['conditions'] = conditions                                          

In [27]: df                                                                     
Out[27]: 
  winner newcol2  conditions
0  white   black           1
1  black   white           1
2  white   white           0
3   draw   white           0
4  black    draw           0

您可以根据任何条件修改代码

网友

2楼 · 编辑于 2024-06-07 07:21:38

据我所知，您希望为数据帧中两列的每个唯一组合分配一个值

如果数据帧中没有所有的组合，您可以使用这种方法创建带有代码的dict，或者使用itertools生成dict

combs = set(zip(df['winner'], df['newcol2']))
codes = dict(zip(combs, range(len(combs))))

使用“应用”方法将两列中的组合替换为编码值：

df['result'] = df.apply(lambda x: codes[x['winner'], x['newcol2']], axis=1)

相关问题更多 >

编程相关推荐

热门问题

热门文章