使用Python查找包含关键字数组之一的句子

网友

1楼 · 编辑于 2024-05-13 06:03:51

为什么不使用列表理解？在

print [sent for sent in text.split('.') 
        if any(word in sent for word in define_words.split()) ]

或者，如果更改字符串列表的define_words：

^{pr2}$

网友

2楼 · 编辑于 2024-05-13 06:03:51

我不能评论（我没有足够的声誉），所以这个答案在技术上不是一个答案。在

我不太熟悉regex，但是假设您的re.findall()成功，您可以使用以下代码：

import re, itertools
from collections import Counter
f = open('C:\\Python27\\test\\A.txt')

text = f.read()
everything = []
define_words = ['contractual', 'obligation', 'law', 'employer']
for k in define_words:
    everything.append(re.findall(r"([^.]*?%s[^.]*\.)" % k,text))

everything = list(itertools.chain(*everything))
counts = Counter(everything)
everything = [value for value, count in counts.items() if count > 1]
everything = list(itertools.chain(*everything))
print everything

这将循环遍历数组列表并将值添加到列表中，从而生成列表列表。然后我只保留重复项（好值），并将列表列表转换为一个列表。在

错误：真正的错误是所有东西都是一个列表列表，Counter(everything)不允许这样做。因此，我在Counter()之前将其剥离。在

网友

3楼 · 编辑于 2024-05-13 06:03:51

def init_contains_useful_word(words_to_search_for):

    def contains_useful_word(sentence):
        return any(map(lambda x: x in sentence, words_to_search_for))

with open(filename, 'r') as f:
    text = f.read()

sentences = text.split(".")

for words in list_of_lists:
    contains_useful_word = init_contains_useful_word(words)

    sentences = filter(contains_useful_word, sentences)

with open(filename, 'w') as f:
    f.write(sentences.join(" "))

实际上，如果你愿意，你可以用你的重运算符替换包含有用的单词。在

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用Python查找包含关键字数组之一的句子

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >