如何检查urllib2是否跟随重定向?
我写了这个函数:
def download_mp3(url,name):
opener1 = urllib2.build_opener()
page1 = opener1.open(url)
mp3 = page1.read()
filename = name+'.mp3'
fout = open(filename, 'wb')
fout.write(mp3)
fout.close()
这个函数接收一个网址和一个名字,都是字符串类型。然后它会从这个网址下载一个mp3文件,并用名字变量的值来保存这个文件。
网址的格式是 http://site/download.php?id=xxxx,其中xxxx是一个mp3的ID。
如果这个ID不存在,网站会把我重定向到另一个页面。
所以,我的问题是:我该如何检查这个ID是否存在?我试过用这样的函数来检查网址是否存在:
def checkUrl(url):
p = urlparse(url)
conn = httplib.HTTPConnection(p.netloc)
conn.request('HEAD', p.path)
resp = conn.getresponse()
return resp.status < 400
但似乎不太管用……
谢谢!
2 个回答
2
我给这个问题的回答是这样的:
req = urllib2.Request(url)
try:
response = urllib2.urlopen(url)
except urllib2.HTTPError as e:
# Do something about it
raise HoustonWeHaveAProblem
else:
if response.url != url:
print 'We have redirected!'
5
像这样,检查一下代码:
import urllib2, urllib
class NoRedirectHandler(urllib2.HTTPRedirectHandler):
def http_error_302(self, req, fp, code, msg, headers):
infourl = urllib.addinfourl(fp, headers, req.get_full_url())
infourl.status = code
infourl.code = code
return infourl
http_error_300 = http_error_302
http_error_301 = http_error_302
http_error_303 = http_error_302
http_error_307 = http_error_302
opener = urllib2.build_opener(NoRedirectHandler())
urllib2.install_opener(opener)
response = urllib2.urlopen('http://google.com')
if response.code in (300, 301, 302, 303, 307):
print('redirect')