我正在将普罗米修斯与我从https://github.com/helm/helm下载的头盔图整合到我的库伯内特斯星系群中。如果你必须知道的话,我正在使用Azure部署我的AKS。在我的每个pod中,容器都运行一个Docker映像,其中包括控制主pod中工作流的master_server.py
脚本
我正试图通过官方的Prometheus Python包master_server.py
从我的主pod中获得一些自定义度量。我的master_server.py
看起来像这样
master_server.py
(截断)
import tornado.ioloop
import tornado.options
import tornado.web
import tornado.websocket
import tornado.gen
import tornado.concurrent
import prometheus_client as prom
num_req = prom.Counter('number_of_request_receive_by_master',
'number of request receive by master')
num_worker = prom.Gauge('number_of_worker_available',
'number of worker available')
def main():
logging.debug('Starting up server')
.
.
.
if __name__ == "__main__":
main()
prom.start_http_server(8081)
我在谷歌上搜索了一个lil,发现我需要添加注释,以便普罗米修斯能够从我的主吊舱中获取数据。因此,在我的deployment.yaml
文件中,我添加了以下代码片段,以允许普罗米修斯从我的主吊舱中刮取数据
template:
metadata:
annotations:
prometheus.io/scrape: 'true'
prometheus.io/port: '8081'
不过,它还是不起作用。我无法在Prometheus查询中看到我的自定义指标
以下是我对主吊舱的deployment.yaml
Name: kaldi-feature-test-master
Namespace: kaldi-test
CreationTimestamp: Fri, 10 Jan 2020 01:53:09 +0800
Labels: app.kubernetes.io/instance=kaldi-feature-test
app.kubernetes.io/managed-by=Tiller
app.kubernetes.io/name=kaldi-feature-test-master
helm.sh/chart=kaldi-feature-test-0.1.0
Annotations: deployment.kubernetes.io/revision: 1
Selector: app.kubernetes.io/instance=kaldi-feature-test,app.kubernetes.io/name=kaldi-feature-test-master
Replicas: 2 desired | 2 updated | 2 total | 2 available | 0 unavailable
StrategyType: RollingUpdate
MinReadySeconds: 0
RollingUpdateStrategy: 25% max unavailable, 25% max surge
Pod Template:
Labels: app.kubernetes.io/instance=kaldi-feature-test
app.kubernetes.io/name=kaldi-feature-test-master
Annotations: prometheus.io/port: 8081
prometheus.io/scrape: true
Containers:
kaldi-feature-test-master:
Image: kalditest.azurecr.io/kalditestscaled:latest
Port: 8080/TCP
Host Port: 0/TCP
Command:
/home/appuser/opt/tini
--
/home/appuser/opt/start_master.sh
Limits:
cpu: 2
memory: 2Gi
Requests:
cpu: 2
memory: 2Gi
Liveness: http-get http://:http/ delay=0s timeout=1s period=10s #success=1 #failure=3
Readiness: http-get http://:http/ delay=0s timeout=1s period=10s #success=1 #failure=3
Environment Variables from:
environment-variables-master-secret Secret Optional: false
Environment: <none>
Mounts: <none>
Volumes: <none>
Conditions:
Type Status Reason
---- ------ ------
Available True MinimumReplicasAvailable
Progressing True NewReplicaSetAvailable
OldReplicaSets: <none>
NewReplicaSet: kaldi-feature-test-master-79886c5d76 (2/2 replicas created)
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal ScalingReplicaSet 15m deployment-controller Scaled up replica set kaldi-feature-test-master-79886c5d76 to 2
我该怎么做才能让普罗米修斯从我的主吊舱中获取自定义指标
是的,多亏了查尔斯的评论,我才得以工作
我在端口8080的主pod中为我的应用程序运行了一个Tornado web server,因此可能会中断Prometheus HTTP服务器,从而从主pod中提取度量
最后,我在主吊舱的
deployment.yaml
中打开了另一个8081端口,如下所示然后在主吊舱中运行的python脚本中,我将Prometheus服务器设置为在端口8081处运行。最后它成功了
prom.start_http_server(8081)
从您提供的Python代码和部署YAML文件可以看出,HTTP服务器侦听端口8081,但您只公开了端口8080,不包括端口8081
因此,解决方案是,您需要在部署的容器
kaldi-feature-test-master
和将请求路由到部署的应用程序的服务中公开端口8081相关问题 更多 >
编程相关推荐