Python xpaw包_程序模块 - PyPI

异步web抓取框架

xpaw的Python项目详细描述

https://travis-ci.org/jadbin/xpaw.svg?branch=master

https://coveralls.io/repos/jadbin/xpaw/badge.svg?branch=master

https://img.shields.io/badge/license-Apache2-blue.svg

Key Features

A web scraping framework used to crawl web pages
Data extraction tools used to extract structured data from web pages

Spider Example

以下是我们的一个爬虫类示例，其作用为爬取百度新闻的热点要闻:

fromxpawimportSpider,HttpRequest,Selector,run_spiderclassBaiduNewsSpider(Spider):defstart_requests(self):yieldHttpRequest("http://news.baidu.com/",callback=self.parse)defparse(self,response):selector=Selector(response.text)hot=selector.css("div.hotnews a").textself.log("Hot News:")foriinrange(len(hot)):self.log("%s: %s",i+1,hot[i])if__name__=='__main__':run_spider(BaiduNewsSpider)

在爬虫类中我们定义了一些方法：

start_requests: 返回爬虫初始请求。
parse: 处理请求得到的页面，这里借助 Selector 及CSS Selector语法提取到了我们所需的数据。

Documentation

http://xpaw.readthedocs.io/

欢迎加入QQ群-->： 979659372

xpaw 0.12.0

xpaw的Python项目详细描述

Key Features

Spider Example

Documentation

推荐PyPI第三方库

fddtest

adafruit-circuitpython-irremote

lin-demo

project-settings

RelayMuseum

cloudfront-edge-codes

odoo10-addon-base-location-geonames-import

potp

sunrice

django-babeljs

odoo9-addon-account-invoice-pricelist

Autils

django-offline-messages

de9im

deux-q5

导航栏

项目链接

标签

维护者

最新PyPI项目

最新Python常见问题

xpaw 0.12.0

xpaw的Python项目详细描述

Key Features

Spider Example

Documentation

推荐PyPI第三方库

fddtest

adafruit-circuitpython-irremote

lin-demo

project-settings

RelayMuseum

cloudfront-edge-codes

odoo10-addon-base-location-geonames-import

potp

sunrice

django-babeljs

odoo9-addon-account-invoice-pricelist

Autils

django-offline-messages

de9im

deux-q5

导 航 栏

项目 链接

标 签

维护者

最新PyPI项目

最新Python常见问题

导航栏

项目链接

标签