如何使用GROUPBY获取唯一标识的累计和？

Date Time ID Weight Jul-1 12:00 A 10 Jul-1 12:00 B 20 Jul-1 12:00 C 100 Jul-1 12:10 C 100 Jul-1 12:10 D 30 Jul-1 12:20 C 100 Jul-1 12:20 D 30 Jul-1 12:30 A 10 Jul-1 12:40 E 40 Jul-1 12:50 F 50 Jul-1 1:00 A 40

1条回答

网友

1楼 · 发布于 2024-05-13 22:19:59

下面的代码使用pandas.duplicate()、pandas.merge()、pandas.groupby/sum和{a4}来获得所需的输出：

# creates a series of weights to be considered and rename it to merge
unique_weights = df['weight'][~df.duplicated(['weight'])]
unique_weights.rename('consider_cum', inplace = True)

# merges the series to the original dataframe and replace the ignored values by 0
df = df.merge(unique_weights.to_frame(), how = 'left', left_index=True, right_index=True)
df.consider_cum = df.consider_cum.fillna(0)

# sums grouping by date and time
df = df.groupby(['date', 'time']).sum().reset_index()

# create the cumulative sum column and present the output
df['weight_cumsum'] = df['consider_cum'].cumsum()
df[['date', 'time', 'weight_cumsum']]

生成以下输出：

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何使用GROUPBY获取唯一标识的累计和？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >