无法使用BeautifulSoup检索页面内容

from bs4 import BeautifulSoup import requests root = 'https://www.quora.com/topic/Graduate-Record-Examination-GRE-1' r = requests.get(root) soup = BeautifulSoup(r.text,'html.parser') #**The following worked yielded some results :** #1 a = soup.find_all('div',{'class':'feed'}) print(a) #2 b = soup.find_all('div',{'class':'ContentWrapper'}) print(b) #3 c = soup.find_all('div',{'class':'ContentWrapper'}) print(c) #4 d = soup.find_all('div',{'class':'feed'}) print(d) #5 e = soup.find_all('div',{'class':'TopicFeed'}) print(e)

1条回答

网友

1楼 · 发布于 2024-04-20 06:47:08

站点可以配置为基于用户代理发送不同的页面。我遇到了和你一样的问题。它返回了一个空列表。在头文件中添加一个通用的用户代理为我解决了这个问题。你知道吗

from bs4 import BeautifulSoup
import requests
root = 'https://www.quora.com/topic/Graduate-Record-Examination-GRE-1'
headers = {'User-Agent' : 'Mozilla/5.0 (Macintosh; Intel Mac OS X x.y; rv:42.0) Gecko/20100101 Firefox/42.' }
r = requests.get(root,headers=headers)
soup = BeautifulSoup(r.text,'html.parser')
f = soup.findAll('div',{'class':'paged_list_wrapper'})
print(f)

相关问题更多 >

编程相关推荐

热门问题

热门文章