python中的基本正则表达式问题，帮助我学习

import re p = re.compile(r'^(?P<Given>\w+) (?P<Middle>\w\.) (?P<Family>\w+)$', re.MULTILINE) str = "Jack A. Smith\nMary B. Miller" m = p.match(str) print m.group(0) Jack A. Smith print m.group(1) Jack print m.group(2) A. print m.group(3) Smith print m.group(4) Traceback (most recent call last): File "<stdin>", line 1, in <module> IndexError: no such group

3条回答

网友

1楼 · 编辑于 2024-04-18 00:00:05

来自re模块的文档：

注意，即使在多行模式下，重新匹配（）只匹配字符串的开头，而不是每行的开头。

你可以用关于芬德尔或者重新查找要查找所有匹配项：

>>> for match in p.finditer(str):
     ... print match.groups()

 ('Jack', 'A.', 'Smith')
 ('Mary', 'B.', 'Miller')

要使用组名而不是索引，可以指定已使用的组名：

>>> for match in p.finditer(str):
    ... print match.group('Given')

  Jack
  Mary

网友

2楼 · 编辑于 2024-04-18 00:00:05

从re.match()复制

Note that even in MULTILINE mode, re.match() will only match
at the beginning of the string and not at the beginning of each line

这就是为什么你只得到第一场比赛。如果需要所有匹配项，请使用re.findall()

将整个正则表达式包装在()中，下面是一个示例：

p = re.compile(r'^((?P<Given>\w+) (?P<Middle>\w\.) (?P<Family>\w+))$', re.MULTILINE)
str = "Jack A. Smith\nMary B. Miller"
print re.findall(p, str)

输出：

[('Jack A. Smith', 'Jack', 'A.', 'Smith'), ('Mary B. Miller', 'Mary', 'B.', 'Miller')]

更新：：

关于你的问题2：用re.finditer()来回答这个问题。举个例子：

p = re.compile(r'^(?P<FullName>(?P<Given>\w+) (?P<Middle>\w\.) (?P<Family>\w+))$', re.MULTILINE)
str = "Jack A. Smith\nMary B. Miller"
matches = re.finditer(p, str)
for match in matches:
    info = match.groupdict()  ## pulling out the match as dictionary
    print info
    print info['Family']

问题3：

使用re.sub()就足够了。你知道吗

print re.sub("Mary B\. Miller", "Jane M. Goldstein", str)
## notice I have escaped the . with \.
## in regex . means any non white space characters.

网友

3楼 · 编辑于 2024-04-18 00:00:05

I am using the Given, Middle and Family as the tag names for each match, how do I access the data using such tags and not just m.group(i)

您可以使用m.group('Given'), m.group('Middle'), m.group('Family')

Let us say I want to do match and replace? I.e., I want to match Mary B. Miller, and replace by Jane M. Goldstein, such that the replaced string will now be: str = "Jack A. Smith\nJane M. Goldstein". How'd I go to do that?

据我所知，re.sub()可以用于搜索和替换。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章