破壳不开龙麟

2条回答

网友

1楼 · 编辑于 2024-05-23 15:09:15

非常感谢mertyildiran的帮助。在

皮屑对我不管用。有时它能上网，但大多数时候不上网。我不知道为什么。在

不管怎样，我最终得到的代码每次都很好。在

进口废料

类引号(痒。蜘蛛): name=“快板” 起始URL=['http://allegro.pl/sportowe-uzywane-251188?a_enum%5B127779%5D%5B15%5D=15&a_text_i%5B1%5D%5B0%5D=2004&a_text_i%5B1%5D%5B1%5D=2009&a_text_i%5B5%5D%5B0%5D=950&id=251188&offerTypeBuyNow=1&order=p&string=gsxr&bmatch=base-relevance-aut-1-1-0913']

def parse(self, response):
    for lista in response.css("article.offer"):
        yield {
        'link': lista.css('a.offer-title::attr(href)').extract(),            
        }

网友

2楼 · 编辑于 2024-05-23 15:09:15

这对我很有用，我建议你从最基本的教程开始：

import scrapy

class BlogSpider(scrapy.Spider):
    name = 'blogspider'
    start_urls = ['http://allegro.pl/sportowe-uzywane-251188?a_enum%5B127779%5D%5B15%5D=15&a_text_i%5B1%5D%5B0%5D=2004&a_text_i%5B1%5D%5B1%5D=2009&a_text_i%5B5%5D%5B0%5D=950&id=251188&offerTypeBuyNow=1&order=p&string=gsxr&bmatch=base-relevance-aut-1-1-0913']

    def parse(self, response):
        print "                                "
        print response.body
        print "                                "

我可以看到页面的正文。view(response)错误，未定义函数。在

将此代码另存为myspider.py，并使用scrapy runspider myspider.py运行。您将看到一个大字符串打印到您的终端中，即 -s之间的主体

对于破壳：

以shell模式启动：scrapy shell

只需运行：

^{pr2}$

它将在您的默认浏览器中打开刮掉的页面。你的网址对我有用。在

对于标题标签，它显示：

^{3}$

已爬网/刮取的网页将保存在/tmp目录下，类似于/tmp/tmpn8wziQ.html

相关问题更多 >

编程相关推荐

热门问题

热门文章

破壳不开龙麟

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >