使用bs4/python3提取href？

#!/usr/bin/python3 import bs4 as bs import urllib.request import time, datetime, os, requests, lxml.html import re from fake_useragent import UserAgent url = "https://www.cvedetails.com/vulnerability-list.php" ua = UserAgent() header = {'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/59.0.3071.115 Safari/537.36'} snkr = requests.get(url,headers=header) soup = bs.BeautifulSoup(snkr.content,'lxml') for item in soup.find_all('tr', class_="srrowns"): print(item.td.next_sibling.next_sibling.a)

1条回答

网友

1楼 · 发布于 2024-05-15 23:23:26

BeautifulSoup通常有太多用于过滤和获取内容的历史变体，其中一些变体比其他变体更烦人。我忽略了其中的大部分，因为这让人困惑。你知道吗

对于属性，我更喜欢get（），所以这里是item.td.next_sibling.next_sibling.a.get('href')，因为如果没有这样的属性，它将返回None，而不是给出异常。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章