Pandas爆炸功能无法正常工作

title price weight 0 Crloni Model145 $45 200gm 1 Crloni Model145 $45 500gm 2 Crloni Model145 $45 800gm 3 Crloni Model145 $50 200gm 4 Crloni Model145 $50 500gm 5 Crloni Model145 $50 800gm 6 Crloni Model145 $45 200gm 7 Crloni Model145 $45 500gm 8 Crloni Model145 $45 800gm 9 Crloni Model145 $60 200gm 10 Crloni Model145 $60 500gm 11 Crloni Model145 $60 800gm

data['price'] = data['price'].str.split(',') data['weight'] = data['weight'].str.split(',') out = data.explode(['price','weight']) data['description'] = data['description'].mask(data['description].shift() == data['description'])

category title price weight description Shirt men-shirt 20,25,35 100gm,50gm,150gm shirt description.... pant men-pent 40,35,90 200gm,350gm,150gm pant description....

2条回答

网友

1楼 · 编辑于 2024-05-16 06:23:26

如果您有1.3.0之前的Pandas版本，其中添加了多列explode：

由于拆分字符串后的列表具有相同数量的元素，因此可以将Series.explode应用于price列和weight列，以获得预期的输出

import pandas as pd

df = pd.DataFrame({'title': ['Crloni Model145'],
                   'price': ['$45,$50,$60'],
                   'weight': ['200gm,500gm,800gm']})

df['price']=df['price'].str.split(',')
df['weight']=df['weight'].str.split(',')

df = df.set_index(['title']).apply(pd.Series.explode).reset_index()

print(df)

我将索引设置为title，因为我不希望explode应用于该列，然后在末尾重置索引，以便title再次成为一个常规列

输出：

             title price weight
0  Crloni Model145   $45  200gm
1  Crloni Model145   $50  500gm
2  Crloni Model145   $60  800gm

网友

2楼 · 编辑于 2024-05-16 06:23:26

更新你的pandas和explode现在可以接受两列

df['price'] = df['price'].str.split(',')
df['weight'] = df['weight'].str.split(',')
out = df.explode(['price','weight'])

相关问题更多 >

编程相关推荐

热门问题

热门文章