通过str.contains（）索引，然后将值插入另一列

import pandas as pd import re df = pd.DataFrame({'id': pd.Series([1, 2, 3, 4, 5, 6, 7, 8, 9, 10],dtype='int64',index=pd.RangeIndex(start=0, stop=10, step=1)), 'store': pd.Series(['McDonalds', 'Lidl', 'Lidl New York 123', 'KFC ', 'Taco Restaurant', 'Lidl Berlin', 'Popeyes', 'Wallmart', 'Aldi', 'London Lidl'],dtype='object',index=pd.RangeIndex(start=0, stop=10, step=1))}, index=pd.RangeIndex(start=0, stop=10, step=1)) print(df) id store 0 1 McDonalds 1 2 Lidl 2 3 Lidl New York 123 3 4 KFC 4 5 Taco Restaurant 5 6 Lidl Berlin 6 7 Popeyes 7 8 Wallmart 8 9 Aldi 9 10 London Lidl

df[df.store.str.contains(r'\blidl\b',re.I,regex=True)]['standard'] = 'Lidl' print(df) id store standard_name 0 1 McDonalds NaN 1 2 Lidl NaN 2 3 Lidl New York 123 NaN 3 4 KFC NaN 4 5 Taco Restaurant NaN 5 6 Lidl Berlin NaN 6 7 Popeyes NaN 7 8 Wallmart NaN 8 9 Aldi NaN 9 10 London Lidl NaN

id store standard_name 0 1 McDonalds NaN 1 2 Lidl Lidl 2 3 Lidl New York 123 Lidl 3 4 KFC NaN 4 5 Taco Restaurant NaN 5 6 Lidl Berlin Lidl 6 7 Popeyes NaN 7 8 Wallmart NaN 8 9 Aldi NaN 9 10 London Lidl Lidl

1条回答

网友

1楼 · 发布于 2024-06-06 10:35:36

如果要设置新列，可以将^{}与case=False或re.I一起使用：

注意：d['standard_name'] = pd.np.nan不是必需的，您可以忽略它

df.loc[df.store.str.contains(r'\blidl\b', case=False), 'standard'] = 'Lidl'
#alternative
#df.loc[df.store.str.contains(r'\blidl\b', flags=re.I), 'standard'] = 'Lidl'
print (df)
   id              store standard
0   1          McDonalds      NaN
1   2               Lidl     Lidl
2   3  Lidl New York 123     Lidl
3   4               KFC       NaN
4   5    Taco Restaurant      NaN
5   6        Lidl Berlin     Lidl
6   7            Popeyes      NaN
7   8           Wallmart      NaN
8   9               Aldi      NaN
9  10        London Lidl     Lidl

或者可以使用另一种方法-^{}：

df['standard'] = df['store'].str.extract(r'(?i)(\blidl\b)')
#alternative
#df['standard'] = df['store'].str.extract(r'(\blidl\b)', re.I)
print (df)
   id              store standard
0   1          McDonalds      NaN
1   2               Lidl     Lidl
2   3  Lidl New York 123     Lidl
3   4               KFC       NaN
4   5    Taco Restaurant      NaN
5   6        Lidl Berlin     Lidl
6   7            Popeyes      NaN
7   8           Wallmart      NaN
8   9               Aldi      NaN
9  10        London Lidl     Lidl

相关问题更多 >

编程相关推荐

热门问题

热门文章