比较两个dataframe列以匹配percentag

网友

1楼 · 编辑于 2024-05-29 02:02:32

尝试使用pandas DataFrame的isin函数。假设df是第一个数据帧，words是一个列表：

In[1]: (df.isin(words).sum()/df.shape[0])*100
Out[1]:
cars     100.0
bikes     20.0
dtype: float64

您可能需要在df和words列表中使用小写字符串，以避免任何大小写问题。在

网友

2楼 · 编辑于 2024-05-29 02:02:32

使用^{}和^{}构造Series，然后调用^{}和{a4}方法：

# Setup
df1 = pd.DataFrame({'cars': {0: 'swift', 1: 'maruti', 2: 'waganor', 3: 'hyundai', 4: 'jeep'}, 'bikes': {0: 'RE', 1: 'Ninja', 2: 'Bajaj', 3: 'pulsar', 4: np.nan}})
df2 = pd.DataFrame({'words': {0: 'swift', 1: 'RE', 2: 'maruti', 3: 'waganor', 4: 'hyundai', 5: 'jeep', 6: 'bajaj'}})

match_rates = pd.Series({col: np.in1d(df1[col], df2['words']).mean() for col in df1})

print('{:.0%} match header - {}'.format(match_rates.max(), match_rates.idxmax()))

[出去]

^{pr2}$

网友

3楼 · 编辑于 2024-05-29 02:02:32

您可以先将列放入列表中：

dfCarsList = df['cars'].tolist()
dfWordsList = df['words'].tolist()
dfBikesList = df['Bikes'].tolist()

然后迭代列表进行比较：

^{pr2}$

可以使用比输出更高的数字。在

相关问题更多 >

编程相关推荐

热门问题

热门文章

比较两个dataframe列以匹配percentag

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >