如何在Python Tornado服务器中有效使用多进程请求？

Question

我正在使用一个非阻塞的Python服务器Tornado。我的一个类是处理GET请求的，这些请求可能需要比较长的时间才能完成（大约5到10秒）。问题是，Tornado在处理这些请求时会阻塞，导致后续的快速请求必须等到慢请求完成后才能继续。

我查看了这个链接：https://github.com/facebook/tornado/wiki/Threading-and-concurrency，得出的结论是我想要结合使用#3（其他进程）和#4（其他线程）。单独使用#4时遇到了一些问题，我无法在另一个线程进行“重负载”操作时可靠地将控制权交还给ioloop。（我猜这可能是由于全局解释器锁（GIL）和重负载任务占用高CPU导致的，但这只是我的猜测。）

因此，我一直在尝试通过在这些慢GET请求中使用单独的进程来处理“重负载”任务，然后在进程完成后将回调放回Tornado的ioloop中，以完成请求。这使得ioloop可以处理其他请求。

我创建了一个简单的示例来演示可能的解决方案，但我很想听听社区的反馈。

我的问题有两个方面：这个当前的方法如何简化？可能存在哪些陷阱？

方法

利用Tornado内置的asynchronous装饰器，这样请求可以保持打开状态，ioloop可以继续运行。
使用Python的multiprocessing模块为“重负载”任务创建一个单独的进程。我最开始尝试使用threading模块，但无法可靠地将控制权交还给ioloop。看起来multiprocessing还可以利用多核处理器。
在主ioloop进程中启动一个“监视者”线程，使用threading模块，它的工作是监视multiprocessing.Queue，以获取“重负载”任务完成后的结果。这是必要的，因为我需要一种方法来知道重负载任务已经完成，同时还能通知ioloop这个请求现在已经结束。
确保“监视者”线程经常使用time.sleep(0)调用将控制权交还给主ioloop，以便其他请求能够继续处理。
当队列中有结果时，从“监视者”线程添加一个回调，使用tornado.ioloop.IOLoop.instance().add_callback()，这是文档中说明的唯一安全的从其他线程调用ioloop实例的方法。
然后确保在回调中调用finish()以完成请求并返回回复。

下面是一些示例代码，展示了这种方法。multi_tornado.py是实现上述大纲的服务器，call_multi.py是一个示例脚本，用于以两种不同的方式调用服务器以进行测试。两个测试都使用3个慢GET请求，后面跟着20个快速GET请求。结果显示了在开启和未开启线程的情况下的表现。

在没有开启线程的情况下，3个慢请求会阻塞（每个请求大约需要1秒多一点才能完成）。在一些慢请求之间，有几个快速请求挤了进来（不太确定这是怎么发生的——可能是因为我在同一台机器上同时运行服务器和客户端测试脚本）。这里的重点是，所有快速请求都在不同程度上被阻塞。

而在开启线程的情况下，20个快速请求会立即全部完成，3个慢请求则大约在同一时间完成，因为它们是并行运行的。这是我们想要的效果。3个慢请求并行完成需要2.5秒，而在没有线程的情况下，这3个慢请求总共需要大约3.5秒。因此，整体上大约提高了35%的速度（我猜是因为多核共享）。但更重要的是，快速请求立即得到了处理，而不是被慢请求拖延。

我对多线程编程没有太多经验——所以虽然这个方法似乎有效，但我很想知道：

有没有更简单的方法来实现这个？这个方法可能隐藏着哪些问题？

（注意：未来的一个折中方案可能是运行更多的Tornado实例，并使用像nginx这样的反向代理进行负载均衡。不管怎样，我都会运行多个实例并使用负载均衡器，但我担心仅仅通过增加硬件来解决这个问题，因为这似乎与阻塞问题直接相关。）

示例代码

multi_tornado.py（示例服务器）：

import time
import threading
import multiprocessing
import math

from tornado.web import RequestHandler, Application, asynchronous
from tornado.ioloop import IOLoop


# run in some other process - put result in q
def heavy_lifting(q):
    t0 = time.time()
    for k in range(2000):
        math.factorial(k)

    t = time.time()
    q.put(t - t0)  # report time to compute in queue


class FastHandler(RequestHandler):
    def get(self):
        res = 'fast result ' + self.get_argument('id')
        print res
        self.write(res)
        self.flush()


class MultiThreadedHandler(RequestHandler):
    # Note:  This handler can be called with threaded = True or False
    def initialize(self, threaded=True):
        self._threaded = threaded
        self._q = multiprocessing.Queue()

    def start_process(self, worker, callback):
        # method to start process and watcher thread
        self._callback = callback

        if self._threaded:
            # launch process
            multiprocessing.Process(target=worker, args=(self._q,)).start()

            # start watching for process to finish
            threading.Thread(target=self._watcher).start()

        else:
            # threaded = False just call directly and block
            worker(self._q)
            self._watcher()

    def _watcher(self):
        # watches the queue for process result
        while self._q.empty():
            time.sleep(0)  # relinquish control if not ready

        # put callback back into the ioloop so we can finish request
        response = self._q.get(False)
        IOLoop.instance().add_callback(lambda: self._callback(response))


class SlowHandler(MultiThreadedHandler):
    @asynchronous
    def get(self):
        # start a thread to watch for
        self.start_process(heavy_lifting, self._on_response)

    def _on_response(self, delta):
        _id = self.get_argument('id')
        res = 'slow result {} <--- {:0.3f} s'.format(_id, delta)
        print res
        self.write(res)
        self.flush()
        self.finish()   # be sure to finish request


application = Application([
    (r"/fast", FastHandler),
    (r"/slow", SlowHandler, dict(threaded=False)),
    (r"/slow_threaded", SlowHandler, dict(threaded=True)),
])


if __name__ == "__main__":
    application.listen(8888)
    IOLoop.instance().start()

call_multi.py（客户端测试器）：

import sys
from tornado.ioloop import IOLoop
from tornado import httpclient


def run(slow):
    def show_response(res):
        print res.body

    # make 3 "slow" requests on server
    requests = []
    for k in xrange(3):
        uri = 'http://localhost:8888/{}?id={}'
        requests.append(uri.format(slow, str(k + 1)))

    # followed by 20 "fast" requests
    for k in xrange(20):
        uri = 'http://localhost:8888/fast?id={}'
        requests.append(uri.format(k + 1))

    # show results as they return
    http_client = httpclient.AsyncHTTPClient()

    print 'Scheduling Get Requests:'
    print '------------------------'
    for req in requests:
        print req
        http_client.fetch(req, show_response)

    # execute requests on server
    print '\nStart sending requests....'
    IOLoop.instance().start()

if __name__ == '__main__':
    scenario = sys.argv[1]

    if scenario == 'slow' or scenario == 'slow_threaded':
        run(scenario)

测试结果

通过运行python call_multi.py slow（阻塞行为）：

Scheduling Get Requests:
------------------------
http://localhost:8888/slow?id=1
http://localhost:8888/slow?id=2
http://localhost:8888/slow?id=3
http://localhost:8888/fast?id=1
http://localhost:8888/fast?id=2
http://localhost:8888/fast?id=3
http://localhost:8888/fast?id=4
http://localhost:8888/fast?id=5
http://localhost:8888/fast?id=6
http://localhost:8888/fast?id=7
http://localhost:8888/fast?id=8
http://localhost:8888/fast?id=9
http://localhost:8888/fast?id=10
http://localhost:8888/fast?id=11
http://localhost:8888/fast?id=12
http://localhost:8888/fast?id=13
http://localhost:8888/fast?id=14
http://localhost:8888/fast?id=15
http://localhost:8888/fast?id=16
http://localhost:8888/fast?id=17
http://localhost:8888/fast?id=18
http://localhost:8888/fast?id=19
http://localhost:8888/fast?id=20

Start sending requests....
slow result 1 <--- 1.338 s
fast result 1
fast result 2
fast result 3
fast result 4
fast result 5
fast result 6
fast result 7
slow result 2 <--- 1.169 s
slow result 3 <--- 1.130 s
fast result 8
fast result 9
fast result 10
fast result 11
fast result 13
fast result 12
fast result 14
fast result 15
fast result 16
fast result 18
fast result 17
fast result 19
fast result 20

通过运行python call_multi.py slow_threaded（期望的行为）：

Scheduling Get Requests:
------------------------
http://localhost:8888/slow_threaded?id=1
http://localhost:8888/slow_threaded?id=2
http://localhost:8888/slow_threaded?id=3
http://localhost:8888/fast?id=1
http://localhost:8888/fast?id=2
http://localhost:8888/fast?id=3
http://localhost:8888/fast?id=4
http://localhost:8888/fast?id=5
http://localhost:8888/fast?id=6
http://localhost:8888/fast?id=7
http://localhost:8888/fast?id=8
http://localhost:8888/fast?id=9
http://localhost:8888/fast?id=10
http://localhost:8888/fast?id=11
http://localhost:8888/fast?id=12
http://localhost:8888/fast?id=13
http://localhost:8888/fast?id=14
http://localhost:8888/fast?id=15
http://localhost:8888/fast?id=16
http://localhost:8888/fast?id=17
http://localhost:8888/fast?id=18
http://localhost:8888/fast?id=19
http://localhost:8888/fast?id=20

Start sending requests....
fast result 1
fast result 2
fast result 3
fast result 4
fast result 5
fast result 6
fast result 7
fast result 8
fast result 9
fast result 10
fast result 11
fast result 12
fast result 13
fast result 14
fast result 15
fast result 19
fast result 20
fast result 17
fast result 16
fast result 18
slow result 2 <--- 2.485 s
slow result 3 <--- 2.491 s
slow result 1 <--- 2.517 s

并发编程负载均衡多进程非阻塞 tornado ioloop 监视者线程重负载任务

如何在Python Tornado服务器中有效使用多进程请求？

方法

示例代码

测试结果

3 个回答

撰写回答