在两个点之间提取文本，由一个麻木的

网友

1楼 · 编辑于 2024-04-24 11:06:30

你可以用

\d+\.[^.]+\.

x = '1. Some Header. and some more text 2. Another Header. And that is the end'
import re
re.findall((r'\d+\.[^.]\.'),x)

^{}

网友

2楼 · 编辑于 2024-04-24 11:06:30

您可以使用re.findall：

import re
x = '1. Some Header. and some more text 2. Another Header. And that is the end'
result = re.findall('\d+\.\s+[\w\s]+(?=[\.$])', x)

输出：

['1. Some Header', '2. Another Header']

网友

3楼 · 编辑于 2024-04-24 11:06:30

如果标头应该从1开始，则可以使用捕获组：

(?<!\S)([1-9][0-9]*\.[^.]+)\.

Regex demo

你还能用吗

(?<!\S)(\d+\.[^.]+)\.

解释

(?<!\S)断言直接在左边的不是非空格字符
(捕获组1
- \d+\.[^.]+匹配1+个数字、点和除点以外的任何字符的1+倍
)\.关闭组1并匹配一个点

Regex demo| Python demo

例如使用关于芬德尔你知道吗

import re 

regex = r"(?<!\S)(\d+\.[^.]+)\." 
test_str = "1. Some Header. and some more text 2. Another Header. And that is the end"

print(re.findall(regex, test_str))

结果

['1. Some Header', '2. Another Header']

相关问题更多 >

编程相关推荐

热门问题

热门文章

在两个点之间提取文本，由一个麻木的

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >