python concurrent.futures.ProcessPoolExecutor:submit（）对map（）的性能问题的回答

python concurrent.futures.ProcessPoolExecutor:submit（）对map（）的性能

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

我正在使用<code>concurrent.futures.ProcessPoolExecutor</code>从数字范围中查找数字的出现。其目的是研究从并发中获得的加速性能的数量。为了测试性能，我有一个控件-一个串行代码来执行上述任务（如下所示）。我已经编写了两个并发代码，一个使用<code>concurrent.futures.ProcessPoolExecutor.submit()</code>，另一个使用<code>concurrent.futures.ProcessPoolExecutor.map()</code>来执行相同的任务。它们如下所示。关于起草前者和后者的建议分别见<a href="https://stackoverflow.com/q/42049066/5722359">here</a>和<a href="https://stackoverflow.com/q/42056738/5722359">here</a>。 向这三个代码发出的任务是查找数字5在0到1E8的数字范围内出现的次数。将<code>.submit()</code>和<code>.map()</code>分配给6名工人，并且<code>.map()</code>的块大小为10000。在并发代码中，工作负载离散化的方式是相同的。但是，用于在两个代码中查找匹配项的函数是不同的。这是因为参数传递给<code>.submit()</code>和<code>.map()</code>调用的函数的方式不同。 所有3个代码的报告次数相同，即56953279次。然而，完成这项任务所花的时间却大不相同。<code>.submit()</code>的执行速度是对照组的2倍，而{<cd5>}完成任务的时间是对照组的2倍。 问题： <ol> <li>我想知道<code>.map()</code>的慢性能是我编写的一个工件，还是它本身就是慢的？”如果是前者，我该如何改进。我只是很惊讶，它的表现慢于控制，因为没有太多的动机来使用它。</li> <li>我想知道是否还有什么可以让<code>.submit()</code>代码执行得更快。我的条件是函数<code>_concurrent_submit()</code>必须返回一个iterable，其中包含数字5的数字/出现次数。</li> </ol> 基准结果 <a href="https://i.stack.imgur.com/3x3v2.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/3x3v2.png" alt="benchmark results"/></a> concurrent.futures.ProcessPoolExecutor.submit（） <pre><code>#!/usr/bin/python3.5 # -*- coding: utf-8 -*- import concurrent.futures as cf from time import time from traceback import print_exc def _findmatch(nmin, nmax, number): '''Function to find the occurrence of number in range nmin to nmax and return the found occurrences in a list.''' print('\n def _findmatch', nmin, nmax, number) start = time() match=[] for n in range(nmin, nmax): if number in str(n): match.<a href="https://www.cnpython.com/list/append" class="inner-link">append</a>(n) end = time() - start print("found {0} in {1:.4f}sec".format(len(match),end)) return match def _concurrent_submit(nmax, number, workers): '''Function that utilises concurrent.futures.ProcessPoolExecutor.submit to find the occurences of a given number in a number range in a parallelised manner.''' # 1. Local variables start = time() chunk = nmax // workers futures = [] found =[] #2. Parallelization with cf.ProcessPoolExecutor(max_workers=workers) as executor: # 2.1. Discretise workload and submit to worker pool for i in range(workers): cstart = chunk * i cstop = chunk * (i + 1) if i != workers - 1 else nmax futures.append(executor.submit(_findmatch, cstart, cstop, number)) # 2.2. Instruct workers to process results as they come, when all are # completed or ..... cf.as_completed(futures) # faster than cf.wait() # 2.3. Consolidate result as a list and return this list. for future in futures: for f in future.result(): try: found.append(f) except: print_exc() foundsize = len(found) end = time() - start print('within statement of def _concurrent_submit():') print("found {0} in {1:.4f}sec".format(foundsize, end)) return found if __name__ == '__main__': nmax = int(1E8) # Number range maximum. number = str(5) # Number to be found in number range. workers = 6 # Pool of workers start = time() a = _concurrent_submit(nmax, number, workers) end = time() - start print('\n main') print('workers = ', workers) print("found {0} in {1:.4f}sec".format(len(a),end)) </code></pre> concurrent.futures.ProcessPoolExecutor.map（） <pre><code>#!/usr/bin/python3.5 # -*- coding: utf-8 -*- import concurrent.futures as cf import itertools from time import time from traceback import print_exc def _findmatch(listnumber, number): '''Function to find the occurrence of number in another number and return a string value.''' #print('def _findmatch(listnumber, number):') #print('listnumber = {0} and ref = {1}'.format(listnumber, number)) if number in str(listnumber): x = listnumber #print('x = {0}'.format(x)) return x def _concurrent_map(nmax, number, workers): '''Function that utilises concurrent.futures.ProcessPoolExecutor.map to find the occurrences of a given number in a number range in a parallelised manner.''' # 1. Local variables start = time() chunk = nmax // workers futures = [] found =[] #2. Parallelization with cf.ProcessPoolExecutor(max_workers=workers) as executor: # 2.1. Discretise workload and submit to worker pool for i in range(workers): cstart = chunk * i cstop = chunk * (i + 1) if i != workers - 1 else nmax numberlist = range(cstart, cstop) futures.append(executor.map(_findmatch, numberlist, itertools.repeat(number), chunksize=10000)) # 2.3. Consolidate result as a list and return this list. for future in futures: for f in future: if f: try: found.append(f) except: print_exc() foundsize = len(found) end = time() - start print('within statement of def _concurrent(nmax, number):') print("found {0} in {1:.4f}sec".format(foundsize, end)) return found if __name__ == '__main__': nmax = int(1E8) # Number range maximum. number = str(5) # Number to be found in number range. workers = 6 # Pool of workers start = time() a = _concurrent_map(nmax, number, workers) end = time() - start print('\n main') print('workers = ', workers) print("found {0} in {1:.4f}sec".format(len(a),end)) </code></pre> 序列号： <pre><code>#!/usr/bin/python3.5 # -*- coding: utf-8 -*- from time import time def _serial(nmax, number): start = time() match=[] nlist = range(nmax) for n in nlist: if number in str(n):match.append(n) end=time()-start print("found {0} in {1:.4f}sec".format(len(match),end)) return match if __name__ == '__main__': nmax = int(1E8) # Number range maximum. number = str(5) # Number to be found in number range. start = time() a = _serial(nmax, number) end = time() - start print('\n main') print("found {0} in {1:.4f}sec".format(len(a),end)) </code></pre> 2017年2月13日更新： 除了@niemmi answer，我还提供了一个答案，下面是一些个人研究： <ol> <li>如何进一步加速@niemmi的<code>.map()</code>和<code>.submit()</code>解决方案，以及</li> <li>当<code>ProcessPoolExecutor.map()</code>可以导致比<code>ProcessPoolExecutor.submit()</code>更快的速度时。</li> </ol>

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

你在拿苹果和桔子作比较。当使用<code>map</code>时，您将生成所有的<code>1E8</code>数字并将它们传输到工作进程。与实际执行相比，这需要很多时间。当使用<code>submit</code>时，只需创建6组被传输的参数。 如果您将<code>map</code>更改为使用相同的原理操作，您将得到彼此接近的数字： <pre><code>def _findmatch(nmin, nmax, number): '''Function to find the occurrence of number in range nmin to nmax and return the found occurrences in a list.''' print('\n def _findmatch', nmin, nmax, number) start = time() match=[] for n in range(nmin, nmax): if number in str(n): match.append(n) end = time() - start print("found {0} in {1:.4f}sec".format(len(match),end)) return match def _concurrent_map(nmax, number, workers): '''Function that utilises concurrent.futures.ProcessPoolExecutor.map to find the occurrences of a given number in a number range in a parallelised manner.''' # 1. Local variables start = time() chunk = nmax // workers futures = [] found =[] #2. Parallelization with cf.ProcessPoolExecutor(max_workers=workers) as executor: # 2.1. Discretise workload and submit to worker pool cstart = (chunk * i for i in range(workers)) cstop = (chunk * i if i != workers else nmax for i in range(1, workers + 1)) futures = executor.map(_findmatch, cstart, cstop, itertools.repeat(number)) # 2.3. Consolidate result as a list and return this list. for future in futures: for f in future: try: found.append(f) except: print_exc() foundsize = len(found) end = time() - start print('within statement of def _concurrent(nmax, number):') print("found {0} in {1:.4f}sec".format(foundsize, end)) return found </code></pre> 正确使用<a href="https://docs.python.org/3/library/concurrent.futures.html#concurrent.futures.as_completed" rel="nofollow noreferrer">^{<cd5>}</a>可以提高submit的性能。对于给定的iterable of futures，它将返回一个迭代器，该迭代器将<code>yield</code>futures按照它们完成的顺序。 您还可以跳过将数据复制到另一个数组，并使用<a href="https://docs.python.org/3/library/itertools.html#itertools.chain.from_iterable" rel="nofollow noreferrer">^{<cd7>}</a>将来自未来的结果组合到单个iterable： <pre><code>import concurrent.futures as cf import itertools from time import time from traceback import print_exc from itertools import chain def _findmatch(nmin, nmax, number): '''Function to find the occurrence of number in range nmin to nmax and return the found occurrences in a list.''' print('\n def _findmatch', nmin, nmax, number) start = time() match=[] for n in range(nmin, nmax): if number in str(n): match.append(n) end = time() - start print("found {0} in {1:.4f}sec".format(len(match),end)) return match def _concurrent_map(nmax, number, workers): '''Function that utilises concurrent.futures.ProcessPoolExecutor.map to find the occurrences of a given number in a number range in a parallelised manner.''' # 1. Local variables chunk = nmax // workers futures = [] found =[] #2. Parallelization with cf.ProcessPoolExecutor(max_workers=workers) as executor: # 2.1. Discretise workload and submit to worker pool for i in range(workers): cstart = chunk * i cstop = chunk * (i + 1) if i != workers - 1 else nmax futures.append(executor.submit(_findmatch, cstart, cstop, number)) return chain.from_iterable(f.result() for f in cf.as_completed(futures)) if __name__ == '__main__': nmax = int(1E8) # Number range maximum. number = str(5) # Number to be found in number range. workers = 6 # Pool of workers start = time() a = _concurrent_map(nmax, number, workers) end = time() - start print('\n main') print('workers = ', workers) print("found {0} in {1:.4f}sec".format(sum(1 for x in a),end)) </code></pre>

python concurrent.futures.ProcessPoolExecutor:submit（）对map（）的性能

1 个回答

相关Python问题