我正在尝试用Bs4和请求解析reddit网站上的图像,但我不知道如何解析。这是我的密码:
from bs4 import BeautifulSoup
import requests
source = requests.get('https://www.reddit.com/r/programmingmemes/').text #requests object as text
soup = BeautifulSoup(source, 'lxml')
img = soup.find('div', class_='_3Oa0THmZ3f5iZXAQ0hBJ0k') # finding first post's class
div = img.find('div') #finding 'div'
src = div.find('src') # finding 'srs'
print(src)
我希望输出为:
<div> <img alt="Post image" class="_2_tDEnGMLxpM6uOa2kaDB3 ImageBox-image media-element _1XWObl-3b9tPy64oaG6fax" src="preview.redd.it/ik1g60hzoqc61.jpg? width=640&crop=smart&auto=webp&s=c5fedaba3e5627cf8fcdd008317ac39789d71abc" style="max-height:512px"/> </div>
您需要requests.get()方法中的头才能获得正确的响应
您使用了错误的类名,我认为
_2_tDEnGMLxpM6uOa2kaDB3
应该是正确的(该站点上没有类名为_3Oa0THmZ3f5iZXAQ0hBJ0k
的元素)相关问题 更多 >
编程相关推荐