如何获取Urllib2.Request的请求头?
有没有办法获取使用Urllib2创建的请求的头信息,或者确认通过urllib2.urlopen发送的HTTP头信息?
1 个回答
21
查看请求和响应的头信息很简单,只需要开启调试输出:
opener = urllib2.build_opener(urllib2.HTTPHandler(debuglevel=1))
这样你就可以看到发送和接收到的具体头信息:
>>> opener.open('http://python.org')
send: 'GET / HTTP/1.1\r\nAccept-Encoding: identity\r\nHost: python.org\r\nConnection: close\r\nUser-Agent: Python-urllib/2.7\r\n\r\n'
reply: 'HTTP/1.1 200 OK\r\n'
header: Date: Tue, 14 Jun 2011 08:23:35 GMT
header: Server: Apache/2.2.16 (Debian)
header: Last-Modified: Mon, 13 Jun 2011 19:41:35 GMT
header: ETag: "105800d-486d-4a59d1b6699c0"
header: Accept-Ranges: bytes
header: Content-Length: 18541
header: Connection: close
header: Content-Type: text/html
header: X-Pad: avoid browser bug
<addinfourl at 140175550177224 whose fp = <socket._fileobject object at 0x7f7d29c3d5d0>>
在发起请求之前,你还可以使用 urllib2.Request
对象来设置头信息(这可以覆盖默认的头信息,不过在之前的头信息字典中是不会显示的):
>>> req = urllib2.Request(url='http://python.org')
>>> req.add_header('User-Agent', 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10.6; rv:5.0)')
>>> req.headers
{'User-agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10.6; rv:5.0)'}