多索引数据帧仅获取给定上层索引值的下层索引的可能值

>>> arrays = [['bar', 'bar', 'baz', 'baz', 'foo', 'foo', 'qux', 'qux'], ... ['one', 'two', 'one', 'two', 'one', 'two', 'one', 'two'], ... ['a','b','a','b','b','b','b','b']] >>> tuples = list(zip(*arrays)) >>> index = pd.MultiIndex.from_tuples(tuples, names=['first', 'second','third']) >>> s = pd.Series(np.random.randn(8), index=index) >>> s first second third bar one a -0.598684 two b 0.351421 baz one a -0.618285 two b -1.175418 foo one b -0.093806 two b 1.092197 qux one b -1.515515 two b 0.741408 dtype: float64

>>> s.index MultiIndex(levels=[[u'bar', u'baz', u'foo', u'qux'], [u'one', u'two'], [u'a', u'b']], labels=[[0, 0, 1, 1, 2, 2, 3, 3], [0, 1, 0, 1, 0, 1, 0, 1], [0, 1, 0, 1, 1, 1, 1, 1]], names=[u'first', u'second', u'third'])

>>> s_foo = s.loc['foo'] >>> s_foo second third one b -0.093806 two b 1.092197 dtype: float64 >>> s_foo.index MultiIndex(levels=[[u'one', u'two'], [u'a', u'b']], labels=[[0, 1], [1, 1]], names=[u'second', u'third'])

2条回答

网友

1楼 · 编辑于 2024-05-15 17:57:20

您可以创建sèfoo并显式删除未使用的级别：

s_foo = s.loc['foo']
s_foo.index = s_foo.index.remove_unused_levels()

网友

2楼 · 编辑于 2024-05-15 17:57:20

重置索引似乎是正确的方法，似乎你不希望它成为一个索引（结果就是索引的工作方式）

s.reset_index(level=2).groupby(level=[0])['third'].unique()

或者如果你想的话

s.reset_index(level=2).groupby(level=[0])['third'].value_counts()

相关问题更多 >

编程相关推荐

热门问题

热门文章