使用2列匹配多列的值

ID Name Match1 Random_Col Match2 Price Match3 Match4 Match5 1 Apple Yes Random Value No 10 Yes Yes Yes 2 Apple Yes Random Value1 No 10 Yes Yes No 3 Apple Yes Random Value2 No 15 No Yes Yes 4 Orange No Random Value Yes 12 Yes Yes No 5 Orange No Random Value Yes 12 No No No 6 Banana Yes Random Value No 15 Yes No No 7 Apple Yes Random Value No 15 No Yes Yes

ID Name Match1 Random_Col Match2 Price Match3 Match4 Match5 Final_Match 1 Apple Yes Random Value No 10 Yes Yes Yes Full 2 Apple Yes Random Value1 No 10 Yes Yes No Partial 3 Apple Yes Random Value2 No 15 No Yes Yes Partial 4 Orange No Random Value Yes 12 Yes Yes No Full 5 Orange No Random Value Yes 12 No No No Partial 6 Banana Yes Random Value No 15 Yes No No Full 7 Apple Yes Random Value No 15 No Yes Yes Partial

1条回答

网友

1楼 · 发布于 2024-06-16 11:29:23

用途：

#count Yes values only in Match columns
s = df.filter(like='Match').eq('Yes').sum(axis=1)
#mask for unique combinations
m1 = ~df.duplicated(['Price','Name'], keep=False)
#create new column filled by Series s
m2 = df.assign(new=s).groupby(['Price','Name'])['new'].rank(ascending=False).eq(1)
#chain masks by bitwise OR
df['final_match'] = np.where(m1 | m2, 'Full ','Partial')
print (df)

   ID    Name Match1     Random_Col Match2  Price Match3 Match4 Match5  \
0   1   Apple    Yes   Random Value     No     10    Yes    Yes    Yes   
1   2   Apple    Yes  Random Value1     No     10    Yes    Yes     No   
2   3   Apple    Yes  Random Value2     No     15     No    Yes    Yes   
3   4  Orange     No   Random Value    Yes     12    Yes    Yes     No   
4   5  Orange     No   Random Value    Yes     12     No     No     No   
5   6  Banana    Yes   Random Value     No     15    Yes     No     No   
6   7   Apple    Yes   Random Value     No     15     No    Yes    Yes   

  final_match  
0       Full   
1     Partial  
2     Partial  
3       Full   
4     Partial  
5       Full   
6     Partial

相关问题更多 >

编程相关推荐

热门问题

热门文章