如何在页面上刮取网站链接？

2024-05-31 23:58:04 发布

您现在位置：Python中文网/ 问答频道 /正文

3704

网友

男 | 程序猿一只，喜欢编程写python代码。

我试过使用{“a”，{“target”：“\u blank”}）来获取这个列表中的网址，但这不起作用。你知道吗

在这种情况下，什么才是正确的刮链方法？我应该使用{“target”：“something”}还是{“href”：“something”}？你知道吗

import requests
from bs4 import BeautifulSoup

url = "http://www.gym-directory.com/listing/bulldog-gym/"
r = requests.get(url)

soup = BeautifulSoup(r.text, 'html.parser')

print soup.find("span",{"class":"wlt_shortcode_TITLE"}).text 
print soup.find("span",{"class":"wlt_shortcode_map_location"}).text 
print soup.find("span",{"class":"wlt_shortcode_phoneNum"}).text
print soup.find("span",{"class":"wlt_shortcode_EMAIL"}).text
print soup.find("a",{"target":"_blank"}).text

Tags： text import url target find requests something class

1条回答

网友

1楼 · 发布于 2024-05-31 23:58:04

您需要href：

soup.find("a", {"target":"_blank"})["href"]

如何在页面上刮取网站链接？

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何在页面上刮取网站链接？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >