PandasGroupBy计算满足一定条件的加权百分比

weight race Question_1 Question_2 Question_3 0.9 white 1 5 4 1.1 asian 5 4 3 0.95 white 2 1 5 1.25 black 5 4 3 0.80 other 4 5 2

Question_1 Question_2 Question_3 white 0.00 0.49 0.51 black 1.00 0.00 0.00 asian 1.00 0.00 0.00 other 0.00 1.00 0.00

2条回答

网友

1楼 · 编辑于 2024-05-23 21:42:04

这里有一个解决方案，通过定义一个自定义函数并将该函数应用于每个列。然后，您可以将每个列连接到一个数据帧中：

def wavg(x, col):
    return (x['weight']*(x[col]==5)).sum()/x['weight'].sum()

grouped = df.groupby('race')
pd.concat([grouped.apply(wavg,col) for col in df.columns if col.startswith('Question')],axis=1)\
    .rename(columns = {num:f'Question_{num+1}' for num in range(3)})

输出：

^{pr2}$

网友

2楼 · 编辑于 2024-05-23 21:42:04

下面是问题1的答案。你可以很容易地把它推广到其他问题上。在

# Define a dummy indicating a '5 response'
df['Q1'] = np.where(df['Question_1']==5 ,1, 0)

# Create a weighted version of the above dummy
df['Q1_w'] = df['Q1'] * df['weight']

# Compute the sum by race
ds = df.groupby(['race'])[['Q1_w', 'weight']].sum()

# Compute the weighted average
ds['avg'] = ds['Q1_w'] / ds['weight']

基本上，你首先用种族来计算权重和权重的总和，然后除以权重之和。这就是加权平均数。在

相关问题更多 >

编程相关推荐

热门问题

热门文章

PandasGroupBy计算满足一定条件的加权百分比

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >