正则表达式查找和替换多重

网友

1楼 · 编辑于 2024-04-19 08:09:49

使用ungreedy匹配.*?<；~~在+或*之后的?使其匹配尽可能少的字符。默认设置为贪婪，并尽可能多地使用字符。你知道吗

p = re.compile("\[\[(.*?)\]\]")

网友

2楼 · 编辑于 2024-04-19 08:09:49

您可以使用：

p = re.compile(r"\[\[[^\]]+\]\]")

>>> data = "this is my new string it contains [[hello]] and [[bye]] and nothing else"
>>> p = re.compile(r"\[\[[^\]]+\]\]")
>>> data = p.sub('STAR', data)
>>> data
'this is my new string it contains STAR and STAR and nothing else'

网友

3楼 · 编辑于 2024-04-19 08:09:49

.*是贪婪的，它可以匹配尽可能多的文本，包括]]和[[，因此它可以通过“标记”边界继续前进。你知道吗

一个快速的解决方案是通过添加一个?：

p = re.compile(r"\[\[(.*?)\]\]")

一个更好的解决方案（更健壮、更明确，但速度稍慢）是明确指出我们不能跨越标记边界进行匹配：

p = re.compile(r"\[\[((?:(?!\]\]).)*)\]\]")

说明：

\[\[        # Match [[
(           # Match and capture...
 (?:        # ...the following regex:
  (?!\]\])  # (only if we're not at the start of the sequence ]]
  .         # any character
 )*         # Repeat any number of times
)           # End of capturing group
\]\]        # Match ]]

相关问题更多 >

编程相关推荐

热门问题

热门文章

正则表达式查找和替换多重

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >