如何在一个特定的类中找到所有的<li>？

from bs4 import BeautifulSoup, Comment import re # open original file fo = open('file.php', 'r') # convert to string fo_string = fo.read() # close original file fo.close() # create beautiful soup object from fo_string bs_fo_string = BeautifulSoup(fo_string, "lxml") # get rid of html comments my_comments = bs_fo_string.findAll(text=lambda text:isinstance(text, Comment)) [my_comment.extract() for my_comment in my_comments] my_li_list = bs_fo_string.find_all('ul', 'my_class') print my_li_list

2条回答

网友

1楼 · 编辑于 2024-04-25 01:57:04

这个？

>>> html = """<ul class='my_class'>
... <li>thing one</li>
... <li>thing two</li>
... </ul>"""
>>> from bs4 import BeautifulSoup as BS
>>> soup = BS(html)
>>> for ultag in soup.find_all('ul', {'class': 'my_class'}):
...     for litag in ultag.find_all('li'):
...             print litag.text
... 
thing one
thing two

说明：

soup.find_all('ul', {'class': 'my_class'})查找具有类my_class的所有ul标记。

然后我们在那些标签中找到所有的标签，并打印标签的内容。

网友

2楼 · 编辑于 2024-04-25 01:57:04

这是用美组3做的把戏，这台机器上没有4个。

>>> [li.string for li in bs_fo_string.find('ul', {'class': 'my_class'}).findAll('li')]
[u'thing one', u'thing two']

我们的想法是先用'my_class'类搜索ul，然后在ul中查找li的findAll。

如果同一个类有额外的ul，您可能也希望在ul搜索中使用findAll，并将列表理解更改为嵌套。

说明：

相关问题更多 >

编程相关推荐

热门问题

热门文章