在Regex中查找“|”之间的句子

网友

1楼 · 编辑于 2024-06-01 03:18:07

正则表达式仅在处理复杂字符串时才是必需的。像这样的简单字符串只能使用字符串函数处理：

a = "[\"{somethingsomething|title=hello there!\n|subtitle=how are you\n|subsubtitle=I'm good, thanks\n}\"]"
b = a.lstrip('["{')
c = b.rstrip('}"]')
c.split('|')
# ['somethingsomething',
# 'title=hello there!\n',
# 'subtitle=how are you\n',
# "subsubtitle=I'm good, thanks\n"]

网友

2楼 · 编辑于 2024-06-01 03:18:07

如果您真的必须为此使用正则表达式，请不要用不必要的lookback和lookahead使它们过于复杂。这些位是您试图匹配的模式的一部分，只需这样使用它们：

title=(.*?)[|]subtitle=(.*?)[|]subsubtitle=(.*?)}

Regular expression visualization

Debuggex Demo

注意，我还在前缀中包含了|，因为否则|字符将作为每个组的一部分结束。我把你们每个贪婪的.*组变成了一个非贪婪的.*?。如果要匹配所有的组，这实际上是没有必要的，但是在您的原始示例中，这就是标题最终包含到sub为止的所有内容，并且子标题最终作为副标题的原因。最后，我把}放在末尾，这样就不会把整个外部分组作为子标题的一部分。你知道吗

网友

3楼 · 编辑于 2024-06-01 03:18:07

可以使用split()方法：

In [5]: data = "{somethingsomething|title=hello there!\n|subtitle=how are you\n|subsubtitle=I'm good, thanks\n}"[1:-1]
In [6]: data
Out[6]: "somethingsomething|title=hello there!\n|subtitle=how are you\n|subsubtitle=I'm good, thanks\n"
In [7]: data = data.replace("\n", "")
In [8]: data
Out[8]: "somethingsomething|title=hello there!|subtitle=how are you|subsubtitle=I'm good, thanks"
In [9]: words = data.split("|")
In [10]: words
Out[10]: 
['somethingsomething',
 'title=hello there!',
 'subtitle=how are you',
 "subsubtitle=I'm good, thanks"]
In [11]: title = words[1].split("=")[1]
In [12]: title
Out[12]: 'hello there!'
In [13]: suttitle =  words[2].split("=")[1]
In [14]: suttitle
Out[14]: 'how are you'
In [15]: subsuttitle = words[3].split("=")[1]
In [16]: subsuttitle
Out[16]: "I'm good, thanks"

相关问题更多 >

编程相关推荐

热门问题

热门文章

在Regex中查找“|”之间的句子

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >