Python在多个Groupby上执行Max函数

import pandas as pd import numpy as np data = {'Supplier':['Tom', 'Tom', 'Tom', 'Bill','Bill','Bill'],'Item':['Wood','Wood','Steel','Steel','Steel','Wood'],'Price':[42,33,24,16,12,18]} df = pd.DataFrame(data) df['Opp_Item'] = np.where(df['Item']=="Wood", "Steel", "Wood") df['Opp_Item_Max'] = df.groupby(['Supplier','Opp_Item'])['Price'].transform(max) print(df) Supplier Item Price Opp_Item Opp_Item_Max 0 Tom Wood 42 Steel 42 1 Tom Wood 33 Steel 42 2 Tom Steel 24 Wood 24 3 Bill Steel 16 Wood 16 4 Bill Steel 12 Wood 16 5 Bill Wood 18 Steel 18

2条回答

网友

1楼 · 编辑于 2024-06-08 06:51:03

如果您可以找到每个供应商+项目的最大值，那么您只需交换这些值，并通过join将它们分配回：

v = df.groupby(['Supplier', 'Item']).Price.max().unstack(-1)
# This reversal operation works under the assumption that
# there are only two items and that they are opposites of each other. 
v[:] = v.values[:, ::-1]  

df = (df.set_index(['Supplier', 'Item'])
        .join(v.stack().to_frame('Opp_Item_Max'), how='left')
        .reset_index())

print(df)
  Supplier   Item  Price  Opp_Item_Max
0     Bill  Steel     16            18
1     Bill  Steel     12            18
2     Bill   Wood     18            16
3      Tom  Steel     24            42
4      Tom   Wood     42            24
5      Tom   Wood     33            24

注意：连接后将不会保留数据的顺序。你知道吗

网友

2楼 · 编辑于 2024-06-08 06:51:03

您可以map将groupby前面的值转换为相反的值，然后将其合并回原始数据帧。你知道吗

d = {'Steel': 'Wood', 'Wood': 'Steel'}

df.merge(df.assign(Item = df.Item.map(d))
           .groupby(['Supplier', 'Item'], as_index=False).max(),
         on=['Supplier', 'Item'],  
         how='left',
         suffixes=['', '_Opp_Item'])

  Supplier   Item  Price  Price_Opp_Item
0      Tom   Wood     42              24
1      Tom   Wood     33              24
2      Tom  Steel     24              42
3     Bill  Steel     16              18
4     Bill  Steel     12              18
5     Bill   Wood     18              16

相关问题更多 >

编程相关推荐

热门问题

热门文章