缓存匹配过滤器电话

def badWordMatch(string): bad_words = ["poo", "wee", "barsteward*", "?orrible"] data = string.split() for each in bad_words: l = fnmatch.filter(data, each) if l: return each.replace("?","").replace("*","") return None string_input = "Please do not wee in the swimming pool you 'orrible naughty barstewards!" # Matched: "wee" #string_input = "Please do not dive in the swimming pool you 'orrible naughty barstewards!" # Matched: "barsteward" #string_input = "Please do not dive in the swimming pool you 'orrible naughty kids!" # Matched: "orrible" #string_input = "Please do not dive in the swimming pool you horrible naughty kids!" # Matched: "orrible" #string_input = "Please do not dive in the swimming pool you naughty kids!" # No match! isbadword = badWordMatch(string_input) if isbadword is not None: print("Matched: %s" % (isbadword)) else: print("No match, string is clean!")

1条回答

网友

1楼 · 发布于 2024-05-26 16:27:58

在python3.2+中，fnmatch.filterhas a LRU cache decorator，这意味着缓存了最近的256个调用。{>^{} uses ^{} internally所以您的模式在内部被转换为regex and are hence cached automatically。在

最好还是从坏词列表中构建一个正则表达式，因为from this answer一个（显式编译）正则表达式比示例中几百个（隐式编译）正则表达式快得多。在

相关问题更多 >

编程相关推荐

热门问题

热门文章