找一个由三个大写字母环绕的小写字母

import string, re if __name__ == "__main__": #open the file eqfile = open("string.txt") gibberish = eqfile.read() eqfile.close() r = re.compile("[A-Z]{3}[a-z][A-Z]{3}") print r.findall(gibberish)

3条回答

网友

1楼 · 编辑于 2024-05-18 23:28:14

r = re.compile("(?<=[A-Z]{3})[a-z](?=[A-Z]{3})")

(?<=...)表示正向后看，(?=...)表示正向展望。在

module re

(?=...)
Matches if ... matches next, but doesn’t consume any of the string. This is called a lookahead assertion. For example, Isaac (?=Asimov) will match 'Isaac ' only if it’s followed by 'Asimov'.
(?<=...)
Matches if the current position in the string is preceded by a match for ... that ends at the current position.

网友

2楼 · 编辑于 2024-05-18 23:28:14

您需要用括号捕获您感兴趣的字符串部分，然后用re.MatchObject#group访问它：

r = re.compile("[A-Z]{3}([a-z])[A-Z]{3}")                                                                                                                                      
m = r.match(gibberish)
if m:
   print "Match! Middle letter was " + m.group(1)           
else:
   print "No match."

网友

3楼 · 编辑于 2024-05-18 23:28:14

你已经很接近了！阅读MatchObjects的.group*方法。例如，如果您的脚本以

r = re.compile("[A-Z]{3}([a-z])[A-Z]{3}")
print r.match(gibberish).group(1)

然后在第一组中捕捉所需的角色。在

要解决匹配重复字母的新限制，可以使用反向引用：

^{pr2}$

读起来像：

匹配字母a-Z并记住它。在
匹配找到的第一个字母的两个匹配项。在
匹配您的小写字母并将其存储在名为middle的组中。在
匹配找到的第一个字母的另外三个连续实例。在
如果找到匹配项，则打印middle组的值。在

`(?=...)`

`(?<=...)`

相关问题更多 >

编程相关推荐

热门问题

热门文章