data = {'col1':['Country', 'State', 'City', 'park' ,'avenue'],
'col2':['County','stats','PARK','Avenue', 'cities']}
col1 col2
0 Country County
1 State stats
2 City PARK
3 park Avenue
4 avenue cities
我试着用模糊模糊技术匹配两列的名称,并按分数排序。你知道吗
输出:
col1 col2 score order
0 Country County 92 1
1 Country stats 31 2
2 Country PARK 18 3
3 Country Avenue 17 4
4 Country cities 16 5
5 State County 80 1
6 State stats 36 2
7 State PARK 22 3
8 State Avenue 18 4
9 State cities 16 5
.....
我所做的:
''
from fuzzywuzzy import fuzz
import pandas as pd
import numpy as np
for i in df.col1:
for j in df.col2:
print(i,j,fuzz.token_set_ratio(i, j))
''
我被困在这里了。。你知道吗
让我们做吧
那么
相关问题 更多 >
编程相关推荐