美化组：findAll recursive不

import urllib.request from bs4 import BeautifulSoup import datetime import html import sys articleUrl="https://www.wired.com/2016/07/greatest-feats-inventions-100-years-boeing/" soupArticle=BeautifulSoup(urllib.request.urlopen(articleUrl), "html.parser") articleBody=soupArticle.find("article", {"itemprop":"articleBody"}) articleContentTags=articleBody.findAll(["h1", "h2","h3", "p"], recursive="False") for tag in articleContentTags: print(tag.name) print(tag.parent.encode("utf-8"))

1条回答

网友

1楼 · 发布于 2024-06-16 11:19:29

字符串文本"False"与使用布尔值<False不同，您需要实际传递recursive=False：

articleBody.find_all(["h1", "h2","h3", "p"], recursive=False)

任何非空字符串都将被视为truthy值，您可以传递的唯一有效字符串将是空字符串，即recursive=""。

In [17]: bool("False")
Out[17]: True

In [18]: bool("foo")
Out[18]: True

In [19]: bool("")
Out[19]: False

但是坚持使用实际的布尔值False，也会得到一个空的列表/结果集并返回给recursive=False，而不是像调用find撸u allnotfind那样没有。

相关问题更多 >

编程相关推荐

热门问题

热门文章