连接表中列的值

Zone Store Department TTLSales 0 APV 220 1 APV ST12 100 2 APV ST12 Elec 40 3 APV ST12 Grocery 20 4 APV ST12 CPG 40

Zone Store Department TTLSales id 0 APV 220 APV 1 APV ST12 100 APV.ST12 2 APV ST12 Elec 40 APV.ST12.Elec 3 APV ST12 Grocery 20 APV.ST12.Grocery 4 APV ST12 CPG 40 APV.ST12.CPG

3条回答

网友

1楼 · 编辑于 2024-06-06 08:39:47

尝试：

#Firstly fill NaN's of the columns:
df[['Zone','Store','Department']]=df[['Zone','Store','Department']].fillna('')
#Finally:
df['id']=(df['Zone']+'.'+df['Store']+'.'+df['Department']).str.rstrip('.')

或

如果有超过4列，则使用apply()（从性能角度来看，第一种方法比应用方法快）：

#Firstly fill NaN's of the columns:
df[['Zone','Store','Department']]=df[['Zone','Store','Department']].fillna('')
#Finally:
df['id'] = df[['Zone','Store','Department']].apply('.'.join, axis=1).str.rstrip('.')

网友

2楼 · 编辑于 2024-06-06 08:39:47

可能工作过度，但这里有另一种使用reduce解决此问题的方法：

from functools import reduce

cols = ['Zone','Store','Department']
f = lambda x,y : (x +'.'+y).str.rstrip(".")
#or# f = lambda x,y : x.str.cat(y,sep='.').str.rstrip(".")

df['id'] = reduce(f,map(df.fillna('').get, cols))

print(df)

  Zone Store Department  TTLSales                id
0  APV   NaN        NaN       220               APV
1  APV  ST12        NaN       100          APV.ST12
2  APV  ST12       Elec        40     APV.ST12.Elec
3  APV  ST12    Grocery        20  APV.ST12.Grocery
4  APV  ST12        CPG        40      APV.ST12.CPG

网友

3楼 · 编辑于 2024-06-06 08:39:47

您可以在此处将df.agg与str.join一起使用

df = df.fillna('')
df['id'] = df[['Zone','Store','Department']].agg('.'.join, axis=1)

相关问题更多 >

编程相关推荐

热门问题

热门文章

连接表中列的值

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >