在多线程代理中使用

1条回答

网友

1楼 · 发布于 2024-05-12 13:58:03

直接从文件中：

CONCURRENT_REQUESTS- The maximum number of concurrent (ie. simultaneous) requests that will be performed by the Scrapy downloader.
CONCURRENT_REQUESTS_PER_DOMAIN - The maximum number of concurrent (ie. simultaneous) requests that will be performed to any single domain.
CONCURRENT_REQUESTS_PER_IP - The maximum number of concurrent (ie. simultaneous) requests that will be performed to any single IP. If non-zero, the CONCURRENT_REQUESTS_PER_DOMAIN setting is ignored, and this one is used instead. In other words, concurrency limits will be applied per IP, not per domain.

直接回答你的问题

我怀疑该服务只允许您收集最多20个线程，这意味着它不关心您请求什么，所以您应该使用CONCURRENT_REQUESTS设置为最大20个线程（默认值为16）。在

每个请求都是“某种线程”。它建立在Twisted之上。在你所使用的代理服务看来，没有办法区分两者的区别，所以每个请求都将是一个代理线程！在

相关问题更多 >

编程相关推荐

热门问题

热门文章

在多线程代理中使用

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >