class MySpider(scrapy.Spider):
name = "my_spider"
custom_settings = {
'DOWNLOADER_MIDDLEWARES': {
'project_root_path.MyMiddlewaresFile.MyMiddleware': 300,
}
}
def start_requests(self):
yield Request()
在你的MyMiddlewaresFile.py
class MyMiddleware(object):
def process_request(self, request, spider):
if "/en/" in request.url:
pass #Do not do anything.
else:
#keep processing request
return request
像这样在spider中定义中间件
在你的
MyMiddlewaresFile.py
参见文档:https://doc.scrapy.org/en/latest/topics/downloader-middleware.html#scrapy.downloadermiddlewares.DownloaderMiddleware.process_request
相关问题 更多 >
编程相关推荐