用BeautifulSoup显示p标签内的所有b标签

people = self.concattexts.filter(code='Active') for p in people: soup = BeautifulSoup(p.text_html, 'html.parser') all_people = [b.get_text(separator=' - ', strip=True) for b in soup.find_all('b')] return all_people

2条回答

网友

1楼 · 编辑于 2024-05-16 03:39:15

没有标记的文本是NavigableString：

>>> soup = BeautifulSoup('<p class="name"><b>Name of person</b> City, Country</p>',
...                      'html.parser')
>>> children = list(soup.p.children)
>>> children
[<b>Name of person</b>, u' City, Country']
>>> type(children[-1])
<class 'bs4.element.NavigableString'>
>>> isinstance(children[-1], basestring)
True

我建议获取p的子元素，并确保它们具有正确的结构（后跟一个字符串的<b>标记），然后根据需要提取信息。在

网友

2楼 · 编辑于 2024-05-16 03:39:15

from bs4 import BeautifulSoup
doc = '''
<p class="name">
<b>Name of person</b> City, Country</p>
<p class="name">
<b>Name of person</b></p>
'''
soup = BeautifulSoup(doc,'lxml')

for i in soup.find_all('p', class_='name'):
    print(i.get_text(separator=' - ', strip=True))

输出：

^{pr2}$

get_text()可以得到标签下的所有文本，不需要使用b tag，只要{}就可以了

相关问题更多 >

编程相关推荐

热门问题

热门文章

用BeautifulSoup显示p标签内的所有b标签

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >