子流程的性能。检查输出与子流程。

2条回答

网友

1楼 · 编辑于 2024-04-27 05:25:22

阅读文档时，subprocess.call和subprocess.check_output都是subprocess.Popen的用例。一个小的区别是，如果子进程返回非零退出状态，check_output将引发Python错误。关于check_output（我的重点）的部分强调了更大的差异：

The full function signature is largely the same as that of the Popen constructor, except that stdout is not permitted as it is used internally. All other supplied arguments are passed directly through to the Popen constructor.

那么stdout是如何“内部使用”的呢？让我们比较一下call和check_output：

呼叫

def call(*popenargs, **kwargs):
    return Popen(*popenargs, **kwargs).wait()

检查输出

def check_output(*popenargs, **kwargs):
    if 'stdout' in kwargs:
        raise ValueError('stdout argument not allowed, it will be overridden.')
    process = Popen(stdout=PIPE, *popenargs, **kwargs)
    output, unused_err = process.communicate()
    retcode = process.poll()
    if retcode:
        cmd = kwargs.get("args")
        if cmd is None:
            cmd = popenargs[0]
        raise CalledProcessError(retcode, cmd, output=output)
    return output

沟通

现在我们还要看Popen.communicate。这样做，我们注意到对于一个管道，communicate所做的几件事比像call那样简单地返回Popen().wait()所花费的时间要长。

首先，communicate处理stdout=PIPE，不管您是否设置了shell=True。显然，call没有。它只会让你的壳喷出任何东西。。。使之成为一种安全风险，as Python describes here。

其次，在check_output(cmd, shell=True)（只有一个管道）的情况下。。。子进程发送到stdout的任何内容都由_communicate方法中的线程处理。并且Popen必须加入线程（等待它），然后再等待子进程本身终止！

另外，更简单的是，它将stdout处理为list，然后必须将其连接到字符串中。

简而言之，即使参数最小，check_output在Python进程中花费的时间也比call要多得多。

网友

2楼 · 编辑于 2024-04-27 05:25:22

让我们看看代码。.check_输出有以下等待：

    def _internal_poll(self, _deadstate=None, _waitpid=os.waitpid,
            _WNOHANG=os.WNOHANG, _os_error=os.error, _ECHILD=errno.ECHILD):
        """Check if child process has terminated.  Returns returncode
        attribute.

        This method is called by __del__, so it cannot reference anything
        outside of the local scope (nor can any methods it calls).

        """
        if self.returncode is None:
            try:
                pid, sts = _waitpid(self.pid, _WNOHANG)
                if pid == self.pid:
                    self._handle_exitstatus(sts)
            except _os_error as e:
                if _deadstate is not None:
                    self.returncode = _deadstate
                if e.errno == _ECHILD:
                    # This happens if SIGCLD is set to be ignored or
                    # waiting for child processes has otherwise been
                    # disabled for our process.  This child is dead, we
                    # can't get the status.
                    # http://bugs.python.org/issue15756
                    self.returncode = 0
        return self.returncode

.call使用以下代码等待：

    def wait(self):
        """Wait for child process to terminate.  Returns returncode
        attribute."""
        while self.returncode is None:
            try:
                pid, sts = _eintr_retry_call(os.waitpid, self.pid, 0)
            except OSError as e:
                if e.errno != errno.ECHILD:
                    raise
                # This happens if SIGCLD is set to be ignored or waiting
                # for child processes has otherwise been disabled for our
                # process.  This child is dead, we can't get the status.
                pid = self.pid
                sts = 0
            # Check the pid and loop as waitpid has been known to return
            # 0 even without WNOHANG in odd situations.  issue14396.
            if pid == self.pid:
                self._handle_exitstatus(sts)
        return self.returncode

请注意，这个bug与内部轮询有关。可在http://bugs.python.org/issue15756查看。几乎就是你遇到的问题。

编辑：在.call和.check_输出之间的另一个潜在问题是.check_输出实际上关心stdin和stdout，并将尝试对这两个管道执行IO。如果您正在运行的进程本身进入僵尸状态，则可能是对处于失效状态的管道的读取导致了您正在经历的挂起。

在大多数情况下，僵尸状态会很快被清除，但是，如果它们在系统调用（比如读或写）时被中断，它们就不会被清除。当然，一旦IO不能再执行，读/写系统调用本身就应该被中断，但是，有可能您遇到了某种竞争条件，在这种情况下，事情会以错误的顺序被终止。

在这种情况下，我能想到的唯一确定原因的方法是，要么向子进程文件中添加调试代码，要么调用python调试器，并在遇到所遇到的情况时启动回溯。

呼叫

检查输出

沟通

相关问题更多 >

编程相关推荐

热门问题

热门文章