用groupby计算数据帧上的累积移动平均

time rssi key1 key2 CMA 0 0.021 -71 P A NaN 1 0.022 -60 Q A NaN 2 0.025 -56 P B NaN 3 0.12 -70 Q B NaN 4 0.167 -65 P A NaN 5 0.210 -55 P B NaN 6 0.211 -74 Q A NaN 7 0.213 -62 Q B NaN ...

time rssi key1 key2 CMA 0 0.021 -71 P A NaN 1 0.022 -60 Q A NaN 2 0.025 -56 P B NaN 3 0.12 -70 Q B NaN 4 0.167 -65 P A -68 5 0.210 -55 P B -55.5 6 0.211 -74 Q A -67 7 0.213 -62 Q B -66 ...

import pandas as pd import numpy as np df = pd.DataFrame() df['time'] = [0.021,0.022,0.025,0.12,0.167,0.210,0.211,0.213] df['rssi'] = [-71,-60,-56,-70,-65,-55,-74,-62] df['key1'] = ['P','Q','P','Q','P','P','Q','Q'] df['key2'] = ['A','A','B','B','A','B','A','B'] df["CMA"] = np.nan for key, grp in df.groupby(['key1', 'key2']): i = 0 old_index = 0 for index, row in grp.iterrows(): if i == 0: # allowed alternative df.at[index,'CMA'] = grp.at[index,'rssi'] old_index = index else: df.at[index,'CMA'] = ((df.at[old_index,'CMA'] * i) + df.at[index,'rssi']) / (i+1) old_index = index i += 1 print df

1条回答

网友
1楼 · 发布于 2024-05-19 01:43:59

您可以使用reset_index执行groupby().expanding().mean()：
df['CMA'] = (df.groupby(['key1','key2'], as_index=False)['rssi'] .expanding(min_periods=2).mean() .reset_index(level=0, drop=True) )
输出：
time rssi key1 key2 CMA 0 0.021 -71 P A NaN 1 0.022 -60 Q A NaN 2 0.025 -56 P B NaN 3 0.120 -70 Q B NaN 4 0.167 -65 P A -68.0 5 0.210 -55 P B -55.5 6 0.211 -74 Q A -67.0 7 0.213 -62 Q B -66.0

相关问题更多 >

编程相关推荐

热门问题

热门文章