相同列名的透视表必须在pi之后重复

user region attribute reading Jon Europe fathername peter Jon Europe age 50 Jon Europe mothername mary Jon Europe age 44 Jon Europe brothername duke Jon Europe age 25

1条回答

网友

1楼 · 发布于 2024-06-01 01:20:33

首先，最好避免重复的列名，因此可能的解决方案是使用pivot消除重复值：

print (df)
    user  region    attribute reading
0    Jon  Europe   fathername   peter
1    Jon  Europe          age      50
2    Jon  Europe   mothername    mary
3    Jon  Europe          age      44
4    Jon  Europe  brothername    duke
5    Jon  Europe          age      25
6   Jon1  Europe   fathername   peter
7   Jon1  Europe          age      50
8   Jon1  Europe   mothername    mary
9   Jon1  Europe          age      44
10  Jon1  Europe  brothername    duke
11  Jon1  Europe          age      25

m = df.duplicated(['user','region', 'attribute'], keep=False)
df.loc[m, 'attribute'] += df[m].groupby(['user','region', 'attribute']).cumcount().astype(str)

df = df.pivot_table(index=['user','region'],
                    columns='attribute',
                    values='reading',
                    aggfunc='sum').reindex(df['attribute'].unique(), axis=1)
print (df)
attribute   fathername age0 mothername age1 brothername age2
user region                                                 
Jon  Europe      peter   50       mary   44        duke   25
Jon1 Europe      peter   50       mary   44        duke   25

相关问题更多 >

编程相关推荐

热门问题

热门文章

相同列名的透视表必须在pi之后重复

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >