Python请求库中的代理错误
我正在尝试通过代理服务器在Python中访问网页。我使用的是requests库,但遇到了一个问题,因为我使用的代理需要密码,所以我在认证代理时出现了困难。
proxyDict = {
'http' : 'username:mypassword@77.75.105.165',
'https' : 'username:mypassword@77.75.105.165'
}
r = requests.get("http://www.google.com", proxies=proxyDict)
我遇到了以下错误:
Traceback (most recent call last):
File "<pyshell#13>", line 1, in <module>
r = requests.get("http://www.google.com", proxies=proxyDict)
File "C:\Python27\lib\site-packages\requests\api.py", line 78, in get
:param url: URL for the new :class:`Request` object.
File "C:\Python27\lib\site-packages\requests\api.py", line 65, in request
"""Sends a POST request. Returns :class:`Response` object.
File "C:\Python27\lib\site-packages\requests\sessions.py", line 187, in request
def head(self, url, **kwargs):
File "C:\Python27\lib\site-packages\requests\models.py", line 407, in send
"""
File "C:\Python27\lib\site-packages\requests\packages\urllib3\poolmanager.py", line 127, in proxy_from_url
File "C:\Python27\lib\site-packages\requests\packages\urllib3\connectionpool.py", line 521, in connection_from_url
File "C:\Python27\lib\site-packages\requests\packages\urllib3\connectionpool.py", line 497, in get_host
ValueError: invalid literal for int() with base 10: 'h6f2v6jh5dsxa@77.75.105.165'
我该如何解决这个问题呢?
提前感谢你的帮助。
3 个回答
1
你可以使用 urllib
这个库来实现这个功能。
from urllib import request
request.urlopen("your URL", proxies=request.getproxies())
3
我在Windows上也遇到了类似的问题,发现让requests
正常工作的唯一方法是,在启动Python之前先把代理设置为环境变量。对你来说,可能需要这样做:
set HTTP_PROXY=http://77.75.105.165
set HTTPS_PROXY=https://77.75.105.165
你还可能需要检查一下是否需要特定的端口,如果需要的话,就在网址后面设置它。例如,如果端口是8443,那么可以这样做:
set HTTP_PROXY=http://77.75.105.165:8443
set HTTPS_PROXY=https://77.75.105.165:8443
17
你应该把proxyDict
里面的用户名和密码去掉,改用auth
这个参数。
import requests
from requests.auth import HTTPProxyAuth
proxyDict = {
'http' : '77.75.105.165',
'https' : '77.75.105.165'
}
auth = HTTPProxyAuth('username', 'mypassword')
r = requests.get("http://www.google.com", proxies=proxyDict, auth=auth)