擅长:python、mysql、java
<p>使<a href="http://lxml.de" rel="noreferrer"><strong>lxml</strong></a>比BeautifulSoup更好的一点是支持适当的CSS类选择(如果您想使用它们,甚至支持<a href="http://lxml.de/dev/cssselect.html" rel="noreferrer">full css selectors</a>)</p>
<pre><code>import lxml.html
html = """<html>
<body>
<div class="bread butter"></div>
<div class="bread"></div>
</body>
</html>"""
tree = lxml.html.fromstring(html)
elements = tree.find_class("bread")
for element in elements:
print lxml.html.tostring(element)
</code></pre>
<p>给出:</p>
<pre><code><div class="bread butter"></div>
<div class="bread"></div>
</code></pre>