使用findall捕获分组？

Question

如果我使用 findall(r'regex(with)capturing.goes.here')，我该如何访问捕获的组呢？我知道可以通过 finditer 来做到这一点，但我不想一个一个地遍历。

Answer 1

import re
string = 'Perotto, Pier Giorgio'
names = re.findall(r'''
                 (?P<first>[-\w ]+),\s #first name
                 (?P<last> [-\w ]+) #last name
                 ''',string, re.X|re.M)

print(names)

返回值

[('Perotto', 'Pier Giorgio')]

re.M 这个选项在你的字符串有多行的时候才有意义。另外，我写的正则表达式需要 VERBOSE 模式（也就是 re.X），因为它使用了 ''' 这种写法。

Answer 2

可以随意使用分组。匹配的结果会以一系列的组元组返回：

>>> re.findall('(1(23))45', '12345')
[('123', '23')]

如果你想把完整的匹配结果也包含在内，只需把整个正则表达式放在一个括号里：

>>> re.findall('(1(23)45)', '12345')
[('12345', '23')]

Answer 3

findall 这个函数的作用就是返回你找到的所有匹配部分：

>>> re.findall('abc(de)fg(123)', 'abcdefg123 and again abcdefg123')
[('de', '123'), ('de', '123')]

使用findall捕获分组？

4 个回答

撰写回答