如何检查没有紧跟关键字的单词，以及没有被关键字包围的单词？

网友

1楼 · 编辑于 2024-06-08 14:26:22

使用正则表达式：

import re
m = re.sub(r'\b(\w+)\b the', 'the', 'the part of the fair that attracts the most people is the fireworks')
print([word for word in m.split(' ') if not word.isspace() and word])

输出：

['the', 'part', 'the', 'fair', 'that', 'the', 'most', 'people', 'the', 'fireworks']

网友

2楼 · 编辑于 2024-06-08 14:26:22

I am trying to look for words that do not immediately come before 'the' .

请注意，下面的代码不使用re

words = 'the part of the fair that attracts the most people is the fireworks'
words_list = words.split()
words_not_before_the = []
for idx, w in enumerate(words_list):
    if idx < len(words_list)-1 and words_list[idx + 1] != 'the':
        words_not_before_the.append(w)
words_not_before_the.append(words_list[-1])
print(words_not_before_the)

输出

['the', 'part', 'the', 'fair', 'that', 'the', 'most', 'people', 'the', 'fireworks']

网友

3楼 · 编辑于 2024-06-08 14:26:22

I am trying to look for words that do not immediately come before the.

试试这个：

import re

# The capture group (\w+) matches a word, that is followed by a word, followed by the word: "the"
p = re.compile(r'(\w+)\W\w+\Wthe')
m = p.findall('the part of the fair that attracts the most people is the fireworks')
print(m)

输出：

['part', 'that', 'people']

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何检查没有紧跟关键字的单词，以及没有被关键字包围的单词？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >