如何在python循环上使用scrapy节点

2024-04-25 21:33:40 发布

男 | 程序猿一只，喜欢编程写python代码。

嗨，我第一次尝试xml提要，下面是我的代码

class TestxmlItemSpider(XMLFeedSpider):
    name = "TestxmlItem"
    allowed_domains = {"http://www.nasinteractive.com"}


    start_urls = [
        "http://www.nasinteractive.com/jobexport/advance/hcantexasexport.xml"
    ]
    iterator = 'iternodes'
    itertag = 'job'


    def parse_node(self, response, node):
        title = node.select('title/text()').extract()
        job_code = node.select('job-code/text()').extract()
        detail_url = node.select('detail-url/text()').extract()
        category = node.select('job-category/text()').extract()

        print title,";;;;;;;;;;;;;;;;;;;;;"
        print job_code,";;;;;;;;;;;;;;;;;;;;;"

        item = TestxmlItem()
        item['title'] = node.select('title/text()').extract()
        .......  
        return item

结果：

^{pr2}$

总共有200多个项目，所以我需要循环并将节点文本分配给item 但在这里，当我们打印时，所有的结果都会同时显示出来，实际上，我们如何使用xmlfeedspider在抓取xml文件的节点上循环呢

Tags： text com node http title www job code

1条回答

网友

1楼 · 发布于 2024-04-25 21:33:40

巴勃罗·霍夫曼：

You don't have a "title" field declared in your item (TestxmlItem).

您需要添加：

title = Field()

如何在python循环上使用scrapy节点

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何在python循环上使用scrapy节点

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >