Python newspider包_程序模块 - PyPI

分类扒数据的简易框架

newspider的Python项目详细描述

##示例.py

#--编码：utf-8-。- 从pyquery导入pyquery作为pq

从newspider.interfaces导入* 从newspider.spider导入newspider

类demofetcher（intfetcher）：

定义初始化（自身）：: self.下一页=[]
def fetch_detail_url（self，html）：

对于d中的l（'.page navigator a'）：

自我.下一页.附加（d（l）.attr（'ref'））

返回列表

定义起始页（自）：

返回['http://www.typechodev.com/’,’http://www.typechodev.com/index.php/category/questions/]

定义下一页（自）：

返回self.下一页

类DemoParser（IntParser）：

定义解析（self、tag、html、extras）：: 打印“从url%s接收标记%s%s”%（extras.get（''u url'）、extras.get（'category'）、tag）的内容”

如果u name_uuu=''uu main_uu'：

sp=Newspider（） sp.config（'保护间隔'，0）

sp.add_parser（demoparser（）） sp.add_fetcher（demofetcher（））

sp.run（）

欢迎加入QQ群-->： 979659372

newspider 0.9.9

newspider的Python项目详细描述

推荐PyPI第三方库

pythanos

django-subsites

target-kbc

Living_Observatory_at_Tidmarsh_Farms_Image_System

yuna

ithenticate-api-python

yxspkg-data-icon

cassandra-migrator

geneseekr

python-mars

nbodyswissknife

xlxnester

spm-kernel

tap-typeform

restbase

导航栏

项目链接

标签

维护者

最新PyPI项目

最新Python常见问题

newspider 0.9.9

newspider的Python项目详细描述

推荐PyPI第三方库

pythanos

django-subsites

target-kbc

Living_Observatory_at_Tidmarsh_Farms_Image_System

yuna

ithenticate-api-python

yxspkg-data-icon

cassandra-migrator

geneseekr

python-mars

nbodyswissknife

xlxnester

spm-kernel

tap-typeform

restbase

导 航 栏

项目 链接

标 签

维护者

最新PyPI项目

最新Python常见问题

导航栏

项目链接

标签