python是否可以将gzip api响应文件拆分为更少的GB

import requests import gzip import json import csv # url ='https://api.thisismyurl.com/vulnerabilities/download_data_zip' token = 'blahblahblah' # 'Content-Type': 'application/json' headers = {'X-Risk-Token': token, 'Accept': 'application/json'} response = requests.get(url,headers=headers) print(response.status_code) json_format = json.loads(response.text) print(json_format)

1条回答

网友

1楼 · 发布于 2024-04-19 23:43:09

要做与curl完全相同的事情，必须执行以下操作：

import requests
import shutil

# curl -H "X-Risk-Token: $token" "https://api.nyc3.us.thisismyurl.com/vulnerabilities/download_data_zip" -o file.gz -vv

url ='https://api.thisismyurl.com/vulnerabilities/download_data_zip'
token = 'blahblahblah'
fname = "file.gz"
# 'Content-Type': 'application/json'
headers = {'X-Risk-Token': token } # exactly same header as curl.

# memory friendly way to download big files
with requests.get(url, headers=headers, stream=True) as resp:
    print(res.status_code)
    with open(fname, 'wb') as fout:
        shutil.copyfileobj(resp.raw, fout)

你知道吗json.loads文件（）的原始代码将消耗大量RAM，如果没有足够的可用RAM，可能会导致进程崩溃。你知道吗

你到底想怎么处理这些数据？你知道吗

可视化数据可以通过以下方式完成：

import gzip
import shutil
with gzip.open(fname) as fin:
    shutil.copyfileobj(fin, sys.stdout)

由于我建议仍然没有成功，也没有得到任何信息，这就解释了为什么“配额”会导致python脚本出现问题，而不是curl问题，我建议进行更多的测试。你知道吗

1.）下载但不存储结果（是否是网络被阻止/中止？？）你知道吗

url ='https://api.thisismyurl.com/vulnerabilities/download_data_zip'
token = 'blahblahblah'
headers = {'X-Risk-Token': token } # exactly same header as curl.

# memory friendly way to download big files
with requests.get(url, headers=headers, stream=True) as resp:
    print(res.status_code)
    downloaded = 0
    try:
        for chunk in resp.iter_content(chunk_size=1024): 
            downloaded += len(chunk)
    except Exception:
        print("Downloaded %d bytes and got an exception" % downloaded)
        raise
print("Downloaded %d bytes" % downloaded)

请检查结果是否总是相同的，或者您得到的数字是否不同

相关问题更多 >

编程相关推荐

热门问题

热门文章