2024-06-11 17:15:58 发布
网友
我有一个数据帧需要在uid\u has\u new和uid\u hash上创建模糊比率/hamming距离并创建一个新表
uid uid_hash uid_hash_new 1 123 ABC28071 ABC28079 4 121 ABC28071 ABC28089
伪代码
import pandas as pd def ratio(x,y): return df['ratio'] = df['uid_hash_new','uid_hash'].apply(ratio)
你可以做:
def ratio(a): return a[0]+a[1] df['ratio'] = df['uid_hash_new','uid_hash'].apply(ratio, axis=1)
所以a[0]将是uid_hash_new(x)和a[1]:uid_hash(y)
a[0]
uid_hash_new
a[1]
uid_hash
将^{}与lambda函数一起使用:
df['ratio'] = df.apply(lambda x: ratio(x['uid_hash_new'], x['uid_hash']), axis=1)
你可以做:
所以
a[0]
将是uid_hash_new
(x)和a[1]
:uid_hash
(y)将^{} 与lambda函数一起使用:
相关问题 更多 >
编程相关推荐