从python中的URL下载实时pdf文件时出错

2024-05-15 23:35:51 发布

您现在位置:Python中文网/ 问答频道 /正文

我正在尝试下载一个带有下面Python函数的pdf文件。我可以在浏览器中打开URL(重定向到另一个URL)。但是代码出现404错误。在

import requests
 def downloadFile(url, fileName):
        r = requests.get(url, allow_redirects=True, stream=True)
        with open(fileName, "wb") as pdf:             
            for chunk in r.iter_content(chunk_size=1024):
                if chunk:
                    pdf.write(chunk)


downloadFile("http://pubs.vmware.com/vsphere-55/topic/com.vmware.ICbase/PDF/vsphere-esxi-vcenter-server-552-storage-guide.pdf", "vsphere-esxi-vcenter-server-552-storage-guide.pdf")

Tags: comtrueurlserverpdfstoragefilenamerequests
1条回答
网友
1楼 · 发布于 2024-05-15 23:35:51

很少有网站会根据语言或位置进行屏蔽。以下代码与额外的头部工程

In [11]: def downloadFile(url, fileName):
         headers = {'Accept-Language': 'en-US,en;q=0.9,te;q=0.8'}
         r = requests.get(url, allow_redirects=True, stream=True, headers=headers)
        with open(fileName, "wb") as pdf:             
           for chunk in r.iter_content(chunk_size=1024):
               if chunk:
                    pdf.write(chunk)

In [12]: downloadFile("http://pubs.vmware.com/vsphere-55/topic/com.vmware.ICbase/PDF/vsphere-esxi-vcenter-server-552-storage-guide.pdf", "vsphere-esxi-vcenter-server-552-storage-guide.pdf")

相关问题 更多 >