基于多个条件创建列的干净方式

+----------+-------+------------+--------+ | industry | class | occupation | value | +----------+-------+------------+--------+ | 170 | 4 | 1000 | 123.3 | | 180 | 7 | 3600 | 4543.8 | | 570 | 5 | 990 | 657.4 | +----------+-------+------------+--------+

+----------+-------+------------+--------+------+ | industry | class | occupation | value | type | +----------+-------+------------+--------+------+ | 170 | 4 | 1000 | 123.3 | IOP | | 180 | 7 | 3600 | 4543.8 | QWE | | 570 | 5 | 990 | 657.4 | JKL | +----------+-------+------------+--------+------+

1条回答

网友

1楼 · 发布于 2024-04-25 16:49:34

设置条件和输出并存储在列表中：

a = df['class'].eq(7)  
b = df['class'].eq(8)  
c = df['class'].isin([1,2])    
helper = df['class'].isin([4,5,6]) & (df.industry.isin(range(170, 491)) | df.industry.isin(range(570, 691)))
d =  helper & df.occupation.ge(1000)
e = helper & df.occupation.isin(range(10, 3541))

conds = [a, b, c, d, e]
outs = ['QWE', 'ASD', 'ZXC', 'IOP', 'JKL']

使用np.select。请注意，您有重叠的条件，因此IOP和JKL之间可能存在歧义

df['out'] = np.select(conds, outs, default='BNM')

   industry  class  occupation   value  out
0       170      4        1000   123.3  IOP
1       180      7        3600  4543.8  QWE
2       570      5         990   657.4  JKL

相关问题更多 >

编程相关推荐

热门问题

热门文章