假设我有一个这样的运动列表:
sports=["futball","fitbal","football","tennis","tenis","tenisse","footbal","zennis","ping-pong"]
如果模糊匹配优于0.5,并且不只是与自身匹配,我想创建一个数据帧,将运动的每个元素与其最接近的元素匹配(我想使用fuzzyfuzzy.fuzz.ratio(x,y)函数)
结果应该如下所示:
pd.DataFrame({"sport":sports,"closest_match":["futball","futball","football","tennis","tennis","tennis","futball","tennis","ping-pong"]})
sport closest_match
0 futball futball
1 fitbal futball
2 football football
3 tennis tennis
4 tenis tennis
5 tenisse tennis
6 footbal futball
7 zennis tennis
8 ping-pong ping-pong
谢谢
下面是一个使用itertools.combinations的解决方案:
输出:
相关问题 更多 >
编程相关推荐