用不同大小的数据帧划分pandas中的列

df5 = dataFrame[['PdDistrict' , 'Category']] df5 = df5[pd.notnull(df5['PdDistrict'])] df5 = df5.groupby(['Category', 'PdDistrict']).size() df5 = df5.reset_index() df5 = df5.sort_values(['PdDistrict',0], ascending=False) df6 = df5.groupby('PdDistrict')[0].sum() df6 = df6.reset_index()

'Category' 'PdDistrict' 'count' 'Average' Drugs Bayview 200 0.33 Theft Bayview 200 0.33 Gambling Bayview 200 0.33 Drugs CENTRAL 200 0.22 Theft CENTRAL 200 0.22 Gambling CENTRAL 200 0.22

1条回答

网友

1楼 · 发布于 2024-06-16 11:00:50

您可以将total count列添加到第一个df中，然后可以执行计算：

In [45]:
df['total count'] = df['PdDistrict'].map(df1.set_index('PdDistrict')['total count'])
df

Out[45]:
   Category PdDistrict  count  total count
0     Drugs    Bayview    200          600
1     Theft    Bayview    200          600
2  Gambling    Bayview    200          600
3     Drugs    CENTRAL    300          900
4     Theft    CENTRAL    300          900
5  Gambling    CENTRAL    300          900

In [46]:
df['Average'] = df['count']/df['total count']
df

Out[46]:
   Category PdDistrict  count  total count   Average
0     Drugs    Bayview    200          600  0.333333
1     Theft    Bayview    200          600  0.333333
2  Gambling    Bayview    200          600  0.333333
3     Drugs    CENTRAL    300          900  0.333333
4     Theft    CENTRAL    300          900  0.333333
5  Gambling    CENTRAL    300          900  0.333333

相关问题更多 >

编程相关推荐

热门问题

热门文章