重设索引不重设双groupby后的索引

test = pd.DataFrame({'Animal' : ['Falcon', 'Falcon','Parrot', 'Parrot','Mouse','Mouse'],'Type':['Bird', 'Bird', 'Bird', 'Bird', 'Rodent','Rodent'],'Count' : [380., 370., 24., 26., 1.9, 2.8]}) # second groupby gives a proportion of total animal counts within each type gb = test.groupby(['Type','Animal']).sum().groupby(level=0).apply(lambda x: x / float(x.sum()))

3条回答

网友

1楼 · 编辑于 2024-06-06 17:11:08

你知道怎么计算吗？你知道吗

我认为第二个groupby操作不合适：

gb = test.groupby('Animal').sum().groupby(level=0).apply(lambda x: x / float(x.sum()))

试试这个：

gb = test.groupby("Animal").sum().apply(lambda x: x / float(x.sum())).reset_index()

网友

2楼 · 编辑于 2024-06-06 17:11:08

你误读了错误。错误是在索引中找不到“Animal”，而在列中找不到。这里出现的混乱是因为.loc的工作方式。如果只有一个项目传递给.loc，这将被解释为索引。只有第二项用于列。所以你可以用：

gb.loc[:, 'Animal']

但你也可以简单地做到：

gb['Animal']

网友

3楼 · 编辑于 2024-06-06 17:11:08

When I unstack, I'm unable to reset the index so that I can extract the columns
gb.unstack()
gb.loc['Animal']

你可以这样得到“动物”栏： gb.loc[:,'Animal'] 或者 gb['Animal']

相关问题更多 >

编程相关推荐

热门问题

热门文章