BeautifulSoup4 Python 3.4刮削问题

2024-05-13 04:06:52 发布

男 | 程序猿一只，喜欢编程写python代码。

试图从一个网站上提取数据，该网站在相同的标签名下有一些不同的信息。例如：

<td class=pizza>cheese </td>
<td class=pizza> deluxe</td>
<td class=pizza> pepperoni</td>
<td class=pizza> raccoon</td>
<td class=pizza> mushshroom</td>
<td class=pizza> anchovy</td>

我如何写一行网页刮（使用beautifulsoup4）选择一个不会是奶酪(说我想要豪华的）。是不是有点像：

pizza = soup.find("td", {"class": "pizza"})
print (pizza) #This will give me cheese as a result, and I want deluxe.

Tags：数据信息网页网站标签 class td raccoon

1条回答

网友

1楼 · 发布于 2024-05-13 04:06:52

如果您不需要cheese，可以使用a function将其过滤掉：

soup.find("td", class_="pizza", text=lambda text: text and text.strip() != "cheese")

或者，如果希望第二个td元素具有class="pizza"：

soup.find_all("td", class_="pizza")[1]

BeautifulSoup4 Python 3.4刮削问题

相关问题更多 >

编程相关推荐

热门问题

热门文章

BeautifulSoup4 Python 3.4刮削问题

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >