Python“requests”只返回“Response[403]”

2024-06-16 10:58:02 发布

您现在位置:Python中文网/ 问答频道 /正文

我试图使用Python的requests包自动处理网页上的一些数据,但我只得到“Response[403]”作为返回值。我做错什么了?以下是最小(非)工作示例:

import requests

url = 'http://tools.iedb.org/mhcii/'
seq = "GGPKLRGNVTSNIKFPSDNKGKIIRGSNDKLNKNSEDVLEQSEKSLVSENVPSGLDIDDI"

form_data = {'sequence_text': seq,
             'submit': 'submit'}

response = requests.post(url, data=form_data)

print(response)

我也尝试过给它一个额外的头,其中包含herehere中建议的浏览器信息,但响应仍然保持不变。思想?你知道吗

更新:

我已经合并了下面的建议和403错误消失了!现在,webform返回一个有效的htm页面,其中包含消息“You must select an allege”。我不太确定我做错了什么,因为我在请求中设置了“等位基因列表”:

import requests

url = 'http://tools.iedb.org/mhcii/'
seq = "GGPKLRGNVTSNIKFPSDNKGKIIRGSNDKLNKNSEDVLEQSEKSLVSENVPSGLDIDDI"

client = requests.session()

# retrieve the CSRF token
client.get(url)  # sets cookie
if 'csrftoken' in client.cookies:
    csrftoken = client.cookies['csrftoken'] # Django 1.6 and up
else:
    csrftoken = client.cookies['csrf'] # older versions

form_data = {"sequence_text": seq,
             "method": "1",
             "sequence_format": "fasta",
             "locus_list": "Human, HLA-DR",
             "allele_list": "DRB1*01:01", # or HLA-DRB1*01:01
             "sort_output": "MHC_IC50",
             "output_format": "ascii",
             "submit": "submit",
             "csrfmiddlewaretoken": csrftoken}

header_data = {"Referer": url,
               "User-Agent": "Mozilla/5.0 (X11; Fedora; Linux x86_64; rv:52.0) Gecko/20100101 Firefox/52.0"}

r = client.post(url, data=form_data, headers=header_data)

print(r.text)

Tags: textimportformclienthttpurldatarequests