Pandas：通过比较数据帧行与另一个数据帧的列来创建新列

df1= pd.DataFrame({'alligator_apple': range(1, 11), 'barbadine': range(11, 21), 'capulin_cherry': range(21, 31)}) alligator_apple barbadine capulin_cherry 0 1 11 21 1 2 12 22 2 3 13 23 3 4 14 24 4 5 15 25 5 6 16 26 6 7 17 27 7 8 18 28 8 9 19 29 9 10 20 30

df2= pd.DataFrame({'alligator_apple': [6, 7, 15, 5], 'barbadine': [3, 19, 25, 12], 'capulin_cherry': [1, 9, 15, 27]}) alligator_apple barbadine capulin_cherry 0 6 3 1 1 7 19 9 2 15 25 15 3 5 12 27

alligator_apple barbadine capulin_cherry greater 0 6 3 1 4 1 7 19 9 1 2 15 25 15 0 3 5 12 27 3

2条回答

网友

1楼 · 编辑于 2024-05-16 21:46:47

我相信这正是你想要的：

df2['greater'] = df2.apply(
    lambda row: 
    (df1['alligator_apple'] > row['alligator_apple']) & 
    (df1['barbadine'] > row['barbadine']) & 
    (df1['capulin_cherry'] > row['capulin_cherry']), 
    axis=1,
).sum(axis=1)

print(df2)

输出：

   alligator_apple  barbadine  capulin_cherry  greater
0                6          3               1        4
1                7         19               9        1
2               15         25              15        0
3                5         12              27        3

编辑：如果您想对给定的列集概括并应用此逻辑，我们可以将functools.reduce与operator.and_一起使用：

import functools
import operator

columns = ['alligator_apple', 'barbadine', 'capulin_cherry']

df2['greater'] = df2.apply(
    lambda row: functools.reduce(
        operator.and_, 
        (df1[column] > row[column] for column in columns),
    ), 
    axis=1,
).sum(axis=1)

网友

2楼 · 编辑于 2024-05-16 21:46:47

有一个通用的解决方案应该可以很好地解决这个问题

def gt_mask(row,df):
    mask = True
    for key,val in row.items():
        mask &= df[key] > val
    return len(df[mask])

df2['greater'] = df2.apply(gt_mask,df=df1,axis=1)

输出df2

,alligator_apple,barbadine,capulin_cherry,greater
0,6,3,1,4
1,7,19,9,1
2,15,25,15,0
3,5,12,27,3

这将创建一个掩码，遍历给定行的键/值对

编辑这个答案很有帮助：Masking a DataFrame on multiple column conditions - inside a loop

相关问题更多 >

编程相关推荐

热门问题

热门文章