Python美化组和请求

2024-04-19 11:05:56 发布

您现在位置:Python中文网/ 问答频道 /正文

每当我尝试运行此代码时:

def CheckStock(url,model):

    headers = {'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/39.0.2171.95 Safari/537.36'}
    RawHTML = requests.get(url, headers=headers)
    Page = bs4.BeautifulSoup(RawHTML.text, "lxml")

我不断得到:

requests.exceptions.ReadTimeout: HTTPSConnectionPool(host='www.adidas.com', port=443): Read timed out. (read timeout=None)

我使用的url是:

'https://www.adidas.com/us/stan-smith-shoes/FZ3815.html?forceSelSize=FZ3815_630'

模型是:'FZ3815'


Tags: 代码comurlmozillamodeldefwwwrequests
1条回答
网友
1楼 · 发布于 2024-04-19 11:05:56

要获得正确的页面,请指定不同的User-Agent

例如:

import requests
from bs4 import BeautifulSoup

url = 'https://www.adidas.com/us/stan-smith-shoes/FZ3815.html?forceSelSize=FZ3815_630'
headers = {'User-Agent': 'Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:78.0) Gecko/20100101 Firefox/78.0'}

RawHTML = requests.get(url, headers=headers)
Page = BeautifulSoup(RawHTML.text, "lxml")
print(Page)

印刷品:

<!DOCTYPE html>
<html class="theme-adidas" data-reactroot="" lang="en" prefix="og: http://ogp.me/ns# fb: http://ogp.me/ns/fb#"><head><title data-rh="true" id="meta-title">Stan Smith Tropical Print Sneakers | adidas US</title><meta charset="utf-8" data-rh="true" id="meta-charset"/><meta content="IE=edge,chrome=1" data-rh="true" http-equiv="X-UA-Compatible" id="meta-http-ua-compatible"/><meta content="text/html;charset=utf-8" data-rh="true" http-equiv="Content-Type" id="meta-http-content-type"/><meta content="

...and so on.

相关问题 更多 >