我试图运行一个安装在virtualenv内部的mlflow跟踪服务器,作为Ubuntu 20.04上的systemd服务,但我收到一个错误,表明它找不到gunicorn。这是我的日记
nov 27 10:37:17 Atrium-Power mlflow[81375]: Traceback (most recent call last):
nov 27 10:37:17 Atrium-Power mlflow[81375]: File "/home/praxasense/.miniconda3/envs/mlflow-server/bin/mlflow", line 8, in <module>
nov 27 10:37:17 Atrium-Power mlflow[81375]: sys.exit(cli())
nov 27 10:37:17 Atrium-Power mlflow[81375]: File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/click/core.py", line 829, in __call__
nov 27 10:37:17 Atrium-Power mlflow[81375]: return self.main(*args, **kwargs)
nov 27 10:37:17 Atrium-Power mlflow[81375]: File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/click/core.py", line 782, in main
nov 27 10:37:17 Atrium-Power mlflow[81375]: rv = self.invoke(ctx)
nov 27 10:37:17 Atrium-Power mlflow[81375]: File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/click/core.py", line 1259, in invoke
nov 27 10:37:17 Atrium-Power mlflow[81375]: return _process_result(sub_ctx.command.invoke(sub_ctx))
nov 27 10:37:17 Atrium-Power mlflow[81375]: File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/click/core.py", line 1066, in invoke
nov 27 10:37:17 Atrium-Power mlflow[81375]: return ctx.invoke(self.callback, **ctx.params)
nov 27 10:37:17 Atrium-Power mlflow[81375]: File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/click/core.py", line 610, in invoke
nov 27 10:37:17 Atrium-Power mlflow[81375]: return callback(*args, **kwargs)
nov 27 10:37:17 Atrium-Power mlflow[81375]: File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/mlflow/cli.py", line 392, in server
nov 27 10:37:17 Atrium-Power mlflow[81375]: _run_server(
nov 27 10:37:17 Atrium-Power mlflow[81375]: File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/mlflow/server/__init__.py", line 138, in _run_server
nov 27 10:37:17 Atrium-Power mlflow[81375]: exec_cmd(full_command, env=env_map, stream_output=True)
nov 27 10:37:17 Atrium-Power mlflow[81375]: File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/site-packages/mlflow/utils/process.py", line 34, in exec_cmd
nov 27 10:37:17 Atrium-Power mlflow[81375]: child = subprocess.Popen(
nov 27 10:37:17 Atrium-Power mlflow[81375]: File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/subprocess.py", line 947, in __init__
nov 27 10:37:17 Atrium-Power mlflow[81375]: self._execute_child(args, executable, preexec_fn, close_fds,
nov 27 10:37:17 Atrium-Power mlflow[81375]: File "/home/praxasense/.miniconda3/envs/mlflow-server/lib/python3.9/subprocess.py", line 1819, in _execute_child
nov 27 10:37:17 Atrium-Power mlflow[81375]: raise child_exception_type(errno_num, err_msg, err_filename)
nov 27 10:37:17 Atrium-Power mlflow[81375]: FileNotFoundError: [Errno 2] No such file or directory: 'gunicorn'
我的系统是这样的:
[Unit]
StartLimitBurst=5
StartLimitIntervalSec=33
[Service]
User=praxasense
WorkingDirectory=/home/praxasense
Restart=always
RestartSec=5
ExecStart=/home/praxasense/.miniconda3/envs/mlflow-server/bin/mlflow server --port 3569 --backend-store-uri .mlruns
[Install]
WantedBy=multi-user.target
奇怪的是,如果我在我的终端中从ExecStart
运行命令,它在fish shell中可以正常工作,但在bash中不行,但是如果我执行conda activate mlflow-server
,然后执行mlflow ...
,它就可以正常工作。据我所知,Python解释器应该知道它的虚拟环境,因此它应该像我尝试的那样工作,但显然我遗漏了一些东西,使它无法找到gunicon包,它显然就在那里
有什么想法吗
尝试将venv的bin路径添加到systemd运行的环境中:
我还建议设置
KillMode=mixed
,因为如果您以其他方式终止服务,MLFlow将生成不会终止的gunicorn实例mixed
意味着子进程也将被终止相关问题 更多 >
编程相关推荐