使用列表读取Pandas中的列以创建新的分类列

col_1 col2 Spiderman 2 Abe Lincoln 1 Superman 2 Ghandi 3 Jane Austin 4 Robert de Niro 4 Elon Musk 4 George Bush 1 Bill Gates 4 Barak Obama 1 Anne Frank 3

3条回答

网友

1楼 · 编辑于 2024-06-07 11:03:42

重新构造你的dict，并使用^{}和^{}。你知道吗

注意，在这个例子中，我将dict重命名为my_dict。用“dict”作为名字是个坏主意。你知道吗

remapped_dict = {i: k for k, v in my_dict.items() for i in v}

df['col_2'] = df['col_1'].map(remapped_dict).str.extract(r'(\d+)')

[输出]

             col_1 col_2
0        Spiderman     2
1      Abe Lincoln     1
2         Superman     2
3           Ghandi     3
4      Jane Austin   NaN
5   Robert de Niro   NaN
6        Elon Musk   NaN
7      George Bush     1
8       Bill Gates   NaN
9      Barak Obama     1
10      Anne Frank   NaN

网友

2楼 · 编辑于 2024-06-07 11:03:42

展平你的dict，然后我们做map，也不要把你的dict命名为dict

from functools import reduce

yourd = reduce(lambda a, b: dict(a, **b), [dict.fromkeys(y,x) for x , y in d.items()])
df['New']=df.col_1.map(yourd)
df
Out[194]: 
             col_1  New
0        Spiderman   l2
1      Abe Lincoln   l1
2         Superman   l2
3           Ghandi   l3
4      Jane Austin  NaN
5   Robert de Niro  NaN
6        Elon Musk  NaN
7      George Bush   l1
8       Bill Gates  NaN
9      Barak Obama   l1
10      Anne Frank  NaN

网友

3楼 · 编辑于 2024-06-07 11:03:42

我认为您需要按字典循环并将值keys与^{}一起设置，以便在示例数据get NaNs中检查成员身份，因为dict中缺少另一个值：

#not use python reserved word dict for variable name
d = {'l1': l1, 'l2': l2,'l3': l3} 

for k, v in d.items():
    df.loc[df['col_1'].isin(v), 'new'] = k
print (df)
             col_1  new
0        spiderman   l2
1      Abe Lincoln   l1
2         superman   l2
3           Ghandi   l3
4      Jane Austin  NaN
5   Robert de Niro  NaN
6        Elon Musk  NaN
7      George Bush   l1
8       Bill Gates  NaN
9      Barak Obama   l1
10      Anne Frank  NaN

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用列表读取Pandas中的列以创建新的分类列

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >