使用线性搜索的Python拼写检查器

import re import time start_time = time.time() def LinearSearch(Target, Words): #Linear search for target in words. Words need not be sorted. for s in Words: if s==Target: return True return False # Gets the Dictionary. Words = [s.strip("\n").lower() for s in open("10kWords.txt")] # Gets ShakespearesFullWorks and Encodes it. Input_File = open('ShakespeareFullWorks.txt', "r", encoding='utf-8') lines = Input_File.readlines() for x in lines: if not LinearSearch(x, Words): print (re.findall(r"[\w']+", x)) print ("--- %s seconds ---" % (time.time() - start_time))

1条回答

网友

1楼 · 发布于 2024-05-23 17:47:17

问题是LinearSearch(x, Words)中的x不是一个单词，而是一行。所以每一行都是打印出来的，因为一行可能与一个单词不匹配。你需要做：

for line in lines:
    for word in re.findall(r"[\w']+", line):
        if not LinearSearch(word, Words):
            print(word)

假设re.findall(r"[\w']+", x)返回x中的单词列表。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章