JSON.Dump文件无法捕捉到整个流

import scrapy import json import js2xml from pprint import pprint class StlocSpider(scrapy.Spider): name = "stloc" allowed_domains = ["bestbuy.com"] start_urls = ( 'http://www.bestbuy.com/site/store-locator/11356', 'http://www.bestbuy.com/site/store-locator/46617', 'http://www.bestbuy.com/site/store-locator/77521' ) def parse(self, response): js = response.xpath('//script[contains(.,"window.appData")]/text()').extract_first() jstree = js2xml.parse(js) # print(js2xml.pretty_print(jstree)) app_data_node = jstree.xpath('//assign[left//identifier[@name="appData"]]/right/*')[0] app_data = js2xml.make_dict(app_data_node) print(app_data['stores']) for store in app_data['stores']: yield store with open('stores.json', 'w') as f: json.dump(app_data['stores'], f, indent=4)

1条回答

网友

1楼 · 发布于 2024-05-15 00:02:02

每次都要打开文件进行写入，但要追加。尝试将最后一部分改为：

with open('stores.json', 'a') as f:
    json.dump(app_data['stores'], f, indent=4)

其中'a'打开要追加的文件。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章