Python正则表达式切片

<div class="methodsynopsis dc-description"> stringgettext ( string <tt class="parameter">$message</tt> )</div>

2条回答

网友

1楼 · 编辑于 2024-04-23 15:25:41

当从HTML中提取信息时，不建议只将一些正则表达式组合在一起。正确的方法是使用适当的HTML解析模块。Python有几个很好的模块用于此目的，我特别推荐BeautifulSoup。你知道吗

不要被名字拖后腿-这是一个严肃的模块，被很多人用得很成功。documentation page有很多例子可以帮助您开始了解您的特殊需求。你知道吗

网友

2楼 · 编辑于 2024-04-23 15:25:41

你为什么不试试用BeautifulSoup

示例代码：

from BeautifulSoup import BeautifulSoup
soup = BeautifulSoup(htmldoc)
allSpans = soup.findAll('span', class="type")
for element in allSpans:
    ....