我无法从网页上找到特定的标题

import scrapy from ..items import AmazonsItem class AmazonSpiderSpider(scrapy.Spider): name = 'amazon_spider' start_urls = \['https://www.amazon.in/s?k=agatha+christie+books&crid=3MWRDVZPSKVG0&sprefix=agatha%2Caps%2C269&ref=nb_sb_ss_i_1_6'\] def parse(self, response): items = AmazonsItem() product_name = response.css('.s-access-title').extract()][1]

1条回答

网友

1楼 · 发布于 2024-04-18 17:45:42

试试这个：标题在data-attribute：

import scrapy
from ..items import AmazonsItem

class AmazonSpiderSpider(scrapy.Spider):
    name = 'amazon_spider'
    start_urls = ['https://www.amazon.in/s?k=agatha+christie+books&crid=3MWRDVZPSKVG0&sprefix=agatha%2Caps%2C269&ref=nb_sb_ss_i_1_6']

    def parse(self, response):
        items =  AmazonsItem()
        products_name = response.css('.s-access-title::attr("data-attribute")').extract()
        for product_name in products_name:
            print(product_name)
        next_page = response.css('li.a-last a::attr(href)').get()
            if next_page is not None:
                next_page = response.urljoin(next_page)
                yield scrapy.Request(next_page, callback=self.parse)

输出：

'Murder on the Orient Express (Poirot)'
'And Then There Were None'
.
.

输出：

相关问题更多 >

编程相关推荐

热门问题

热门文章

我无法从网页上找到特定的标题

输出：

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >