我不能用for循环来列出elemen

import scrapy from centech.items import CentechItem class CentechSpiderSpider(scrapy.Spider): name = 'centech_spider' start_urls = ['https://centech.co/nos-entreprises/'] def parse(self, response): items = CentechItem() all_companies = response.xpath("//div[@class = 'fl-post-carousel- post']")[1] # "//div[@class = 'fl-post-carousel-post']")[1] Nom = all_companies.xpath("//h2[contains(@class, 'fl-post-carousel- title')]/text()").extract() Description = all_companies.xpath("//div[contains(@class, 'description')]/p/text()").extract() # Nom = all_companies.response.css("h2.fl-post-carousel- title::text").extract() # Description = all_companies.xpath("p::text").extract() yield {'Nom' : Nom , 'Description' : Description , }

1条回答

网友

1楼 · 发布于 2024-05-16 13:20:20

我不太确定你想要什么样的结果。我猜了一下，修改了你的脚本，得到了以下结果。您需要深入一层才能获取完整的描述，因为有些描述已损坏：

import scrapy

class CentechSpiderSpider(scrapy.Spider):
    name = 'centech_spider'
    start_urls = ['https://centech.co/nos-entreprises/']

    def parse(self, response):
        for item in response.css("a.fl-post-carousel-link"):
            nom = item.css(".description > h2.fl-post-carousel-title::text").get()
            description = item.css(".description > p::text").get()
            yield {'nom':nom,'description':description}

相关问题更多 >

编程相关推荐

热门问题

热门文章