CONCURRENT_REQUESTS- The maximum number of concurrent (ie.
simultaneous) requests that will be performed by the Scrapy
downloader.
CONCURRENT_REQUESTS_PER_DOMAIN - The maximum number of concurrent
(ie. simultaneous) requests that will be performed to any single
domain.
CONCURRENT_REQUESTS_PER_IP - The maximum number of concurrent (ie.
simultaneous) requests that will be performed to any single IP. If
non-zero, the CONCURRENT_REQUESTS_PER_DOMAIN setting is ignored, and
this one is used instead. In other words, concurrency limits will be
applied per IP, not per domain.
直接从文件中:
直接回答你的问题
我怀疑该服务只允许您收集最多20个线程,这意味着它不关心您请求什么,所以您应该使用
CONCURRENT_REQUESTS
设置为最大20个线程(默认值为16)。在每个请求都是“某种线程”。它建立在Twisted之上。在你所使用的代理服务看来,没有办法区分两者的区别,所以每个请求都将是一个代理线程!在
相关问题 更多 >
编程相关推荐