基于4个条件的数据帧列，嵌套np.wh公司

| Group | Before | After | |:-----:|:----------:|:----------:| | G1 | Injection | Injection | | G1 | Injection | Production | | G1 | Production | Injection | | G1 | Production | Production |

2条回答

网友

1楼 · 编辑于 2024-05-29 03:08:17

Before["Injection"]没有按照您的想法操作。在你展示的代码中，它甚至没有被定义。在

你可能想要的是：

# df definition, skipping Group because it is not needed here
df = pd.DataFrame(data={"Before": ["Injection", "Injection", "Production", "Production"], "After": ["Injection", "Production", "Injection", "Production"]})

df["Output"] = "DTI"  # Use one of the cases as default
df.loc[(df["Before"] == "Injection") & (df["After"] == "Production"), "Output"] = "DTWF + DTP"
df[(df["Before"] == "Production") & (df["After"] == "Injection"), "Output"] = "DTWF + DTI"
df[(df["Before"] == "Production") & (df["After"] == "Production"), "Output"] = "DTP"
print(df)
#         After      Before      Output
# 0   Injection   Injection         DTI
# 1  Production   Injection  DTWF + DTP
# 2   Injection  Production  DTWF + DTI
# 3  Production  Production         DTP

如果您有许多这样的组合，那么使用另一个答案中建议的apply可能更合适。在

如果您有很多行，那么将布尔索引（例如df["Before"] == "Production"）保存到变量中，然后直接保存

^{pr2}$

如果您也只有这两种状态，您可以通过使用一元否定运算符~免费获得第二种状态：

df.loc[before_prod & ~after_prod, "Output"] = "DTWF + DTI"

网友

2楼 · 编辑于 2024-05-29 03:08:17

一种方法是使用apply函数：

假设您的数据帧在变量df中，您可以执行以下操作：

import pandas as pd

df = pd.DataFrame(data={"Before": ["Injection", "Injection", "Production", "Production"],
                        "After": ["Injection", "Production", "Injection", "Production"]})
def get_output(x):
    if x['Before'] == 'Injection' and x['After'] == 'Injection':
        return 'DTI'
    elif x['Before'] == 'Injection' and x['After'] == 'Production':
        return 'DTWF + DTP'
    elif x['Before'] == 'Production' and x['After'] == 'Injection':
        return 'DTWF + DTI'
    elif x['Before'] == 'Production' and x['After'] == 'Production':
        return 'DTP'

df['Output'] = df.apply(get_output, axis=1)

相关问题更多 >

编程相关推荐

热门问题

热门文章