Pandas中排序样式的数据帧返回键错误

2024-04-25 05:03:17 发布

您现在位置:Python中文网/ 问答频道 /正文

我想groupbysortindexdataframe进行样式化。但是,编译器返回一个错误

KeyError: ('Other', 'B')

我可以知道这里有什么问题吗

The code to reproduce the above error:



import pandas as pd
import numpy as np
dict_map=dict(group_one=['D','GG','G'],group_two=['A','C','E','F'])
vv=np.random.randn(5, 4)
# ['foo', '*', 'bar','ff']
nn=np.array([['foo', '*', 'bar','ff'], ['foo', '*', 'bar','**'],
             ['foo', '*', 'bar','**'],['foo', '*', 'bar','ff'],
             ['foo', '*', '**','ff']])
arrays = [["bar", "bar", "baz", "baz"],
          ["one", "two", "one", "two"]]
tuples = list(zip(*arrays))

index = pd.MultiIndex.from_tuples(tuples, names=["first", "second"])
df = pd.DataFrame(nn, index=["A", "B", "C","D",'G'], columns=index)
df = df.rename_axis ( index=['my_ch'] ).reset_index()

d = {i:k for k,v in dict_map.items() for i in v}
out = df.assign(Group=df.xs("my_ch",axis=1).map(d).fillna('Other'))


def highlight_(s):
    return np.select(
        condlist=[s.str.contains('\*\*'), s.str.contains('\*')],
        choicelist=['background-color:green', 'background-color:purple'],
        default='')

df=out.style.apply(highlight_)


df.data=df.data.set_index(['Group', 'my_ch'])

df.data=df.data.sort_index(level=0)
df.to_excel('n1test.xlsx')

请注意,在实际用例中。需要对索引level 0进行排序


Tags: mapdfdataindexfoomynpbar
1条回答
网友
1楼 · 发布于 2024-04-25 05:03:17

这应该起作用:

import pandas as pd
import numpy as np


dict_map = dict(group_one=["D", "GG", "G"], 
                group_two=["A", "C", "E", "F"])
vv = np.random.randn(5, 4)
nn = np.array(
    [
        ["foo", "*", "bar", "ff"],
        ["foo", "*", "bar", "**"],
        ["foo", "*", "bar", "**"],
        ["foo", "*", "bar", "ff"],
        ["foo", "*", "**", "ff"],
    ]
)
arrays = [["bar", "bar", "baz", "baz"], ["one", "two", "one", "two"]]
tuples = list(zip(*arrays))

index = pd.MultiIndex.from_tuples(tuples, names=["first", "second"])
df = pd.DataFrame(nn, index=["A", "B", "C", "D", "G"], columns=index)
df = df.rename_axis(index=["my_ch"]).reset_index()

d = {i: k for k, v in dict_map.items() for i in v}
out = df.assign(Group=df.xs("my_ch", axis=1).map(d).fillna("Other"))


def highlight_(s):
    return np.select(
        condlist=[s.str.contains("\*\*"), s.str.contains("\*")],
        choicelist=["background-color:green", "background-color:purple"],
        default=None,
    )


(
    out.sort_index(level=0)
    .set_index(["Group", "my_ch"])
    .style.apply(highlight_)
    .to_excel("n1test.xlsx")
)

主要区别在于先排序,然后设置索引,然后应用样式器并将其保存为Excel文件。所有表达式都用括号括起来,而不是换行符

相关问题 更多 >