在Python中解析caption标记中的文本

1条回答

网友

1楼 · 发布于 2024-04-25 00:43:04

主要问题是

您正在访问match.groups()[0]，而您应该访问match.group(1)，因为您用模式中的一对无转义括号捕获了所需的部分，并且它们是唯一一对捕获括号，因此ID=1。你知道吗
您将贪婪量词与.*一起使用，同时需要.*?尽可能少地匹配除换行符以外的字符

注意：如果文本跨越多行，还应该将re.DOTALL或re.S传递给^{}，以便.可以匹配换行符。你知道吗

import re
regex = r"\[caption.*?](.*?)\[/caption]"
test_str = "[caption id=\"attachment_1749417\" align=\"aligncenter\" width=\"426\"][![femur head cross section](http://www.wired.com/wp-content/uploads/2015/03/femur-head-cross-section.png)](http://www.bartleby.com/107/illus247.html) A cross-section of the top of the thigh bone. ![](http://www.wired.com/wp-content/themes/Phoenix/assets/images/gallery-cam@2x.png) [Gray's Anatomy](http://www.bartleby.com/107/illus247.html) / Public Domain[/caption]"
match = re.search(regex, test_str)
if match:
    print(match.group(1))

打印：

[![femur head cross section](http://www.wired.com/wp-content/uploads/2015/03/femur-head-cross-section.png)](http://www.bartleby.com/107/illus247.html) A cross-section of the top of the thigh bone. ![](http://www.wired.com/wp-content/themes/Phoenix/assets/images/gallery-cam@2x.png) [Gray's Anatomy](http://www.bartleby.com/107/illus247.html) / Public Domain

相关问题更多 >

编程相关推荐

热门问题

热门文章

在Python中解析caption标记中的文本

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >