pandas数据帧返回列的字符串中的第一个字

df = pd.DataFrame({'id' : ['abarth 1.4 a','abarth 1 a','land rover 1.3 r','land rover 2', 'land rover 5 g','mazda 4.55 bl'], 'series': ['a','a','r','','g', 'bl'] })

3条回答

网友

1楼 · 编辑于 2024-05-16 00:57:25

使用str.split和str.get并仅在df.make == ''处使用loc分配

df.loc[df.make == '', 'make'] = df.id.str.split().str.get(0)

print df

               id    make
0      abarth 1.4  abarth
1        abarth 1  abarth
2  land rover 1.3   rover
3    land rover 2   rover
4    land rover 5   rover
5      mazda 4.55   mazda

网友

2楼 · 编辑于 2024-05-16 00:57:25

考虑一个带有loc的正则表达式解决方案，它在第一个空格之前提取所有内容：

df.loc[df['make']=='', 'make'] = df['id'].str.extract('(.*) ', expand=False)

或者，使用numpy的where，它允许if/then/else条件逻辑：

df['make'] = np.where(df['make']=='', 
                      df['id'].str.extract('(.*) ', expand=False), 
                      df['make'])

网友

3楼 · 编辑于 2024-05-16 00:57:25

如果我正确地回答了您的问题，您可以使用replace函数：

df.make = df.make.replace("", test.id)

相关问题更多 >

编程相关推荐

热门问题

热门文章

pandas数据帧返回列的字符串中的第一个字

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >