k8s spring 启动 pod 准备就绪和活跃度探测失败

Question

我已经配置了一个 spring-boot pod 并配置了liveness和readiness探针。 当我启动 pod 时， describe命令显示以下 output。

Events:
  Type     Reason     Age                From               Message
  ----     ------     ----               ----               -------
  Normal   Scheduled  92s                default-scheduler  Successfully assigned pradeep-ns/order-microservice-rs-8tqrv to pool-h4jq5h014-ukl3l
  Normal   Pulled     43s (x2 over 91s)  kubelet            Container image "classpathio/order-microservice:latest" already present on machine
  Normal   Created    43s (x2 over 91s)  kubelet            Created container order-microservice
  Normal   Started    43s (x2 over 91s)  kubelet            Started container order-microservice
  Warning  Unhealthy  12s (x6 over 72s)  kubelet            Liveness probe failed: Get "http://10.244.0.206:8222/actuator/health/liveness": dial tcp 10.244.0.206:8222: connect: connection refused
  Normal   Killing    12s (x2 over 52s)  kubelet            Container order-microservice failed liveness probe, will be restarted
  Warning  Unhealthy  2s (x8 over 72s)   kubelet            Readiness probe failed: Get "http://10.244.0.206:8222/actuator/health/readiness": dial tcp 10.244.0.206:8222: connect: connection refused

pod定义如下

apiVersion: apps/v1
kind: ReplicaSet
metadata:
  name: order-microservice-rs
  labels:
    app: order-microservice
spec:
  replicas: 1
  selector:
    matchLabels:
      app: order-microservice
  template:
    metadata:
      name: order-microservice
      labels:
        app: order-microservice
    spec:
      containers:
        - name: order-microservice
          image: classpathio/order-microservice:latest
          imagePullPolicy: IfNotPresent
          env:
            - name: SPRING_PROFILES_ACTIVE
              value: dev
            - name: SPRING_DATASOURCE_USERNAME
              valueFrom:
                secretKeyRef:
                  key: username
                  name: db-credentials
            - name: SPRING_DATASOURCE_PASSWORD
              valueFrom:
                secretKeyRef:
                  key: password
                  name: db-credentials
          volumeMounts:
            - name: app-config
              mountPath: /app/config
            - name: app-logs
              mountPath: /var/log
          livenessProbe:
            httpGet:
              port: 8222
              path: /actuator/health/liveness
            initialDelaySeconds: 10
            periodSeconds: 10
          readinessProbe:
            httpGet:
              port: 8222
              path: /actuator/health/readiness
            initialDelaySeconds: 10
            periodSeconds: 10
          resources:
            requests:
              memory: "550Mi"
              cpu: "500m"
            limits:
              memory: "550Mi"
              cpu: "750m"
      volumes:
        - name: app-config
          configMap:
            name: order-microservice-config
        - name: app-logs
          emptyDir: {}
      restartPolicy: Always

如果我在replica-set清单中禁用liveness性和readiness性探测并exec到 pod，则在调用http://localhost:8222/actuator/health/liveness liveness 和http://localhost:8222/actuator/health/readiness时会收到有效响应http://localhost:8222/actuator/health/readiness端点。 为什么在使用liveness调用readiness和活跃度端点时，我的 pod 会重新启动并失败。 我哪里错了？

更新如果我删除了resource部分，则 pod 正在运行，但是当添加resource参数时， probes失败。

Answer 1

当您将容器 / spring 应用程序限制为 0.5 个内核（500 毫内核）时，启动可能需要比给定的活动探测阈值更长的时间。

您可以增加它们，或者使用具有更宽松设置的 startupProbe（fe failureThreshold 10）。 在这种情况下，您可以减少 liveness 探测的时间，并在检测到容器启动成功后获得更快的反馈。

Answer 2

您的 pod 配置仅提供 0.5 核 CPU，并且您的检查时间太短。 根据您的服务器 CPU 性能，spring 启动可能需要超过 10 秒的时间。 这是我的 spring 引导 pod 的配置可能会给你一个点。

"livenessProbe": {
              "httpGet": {
                "path": "/actuator/liveness",
                "port": 11032,
                "scheme": "HTTP"
              },
              "initialDelaySeconds": 90,
              "timeoutSeconds": 30,
              "periodSeconds": 30,
              "successThreshold": 1,
              "failureThreshold": 3
            },
            "readinessProbe": {
              "httpGet": {
                "path": "/actuator/health",
                "port": 11032,
                "scheme": "HTTP"
              },
              "initialDelaySeconds": 60,
              "timeoutSeconds": 30,
              "periodSeconds": 30,
              "successThreshold": 1,
              "failureThreshold": 3
            },

而且我没有限制CPU和memory资源，如果你限制CPU，会花费更多时间。 跳这可以帮助你。

Answer 3

当您尝试针对localhost的请求并且它可以工作时，它不能保证它可以在其他网络接口上工作。 Kubelet 是一个节点代理，因此请求将发送到您的eth0或等效项，而不是您的localhost 。

您可以通过从另一个 pod 向您的 pod 的 IP 地址或备份它的服务发出请求来检查它。

可能您正在使您的应用程序在localhost上提供服务，而您必须使其在0.0.0.0或eth0上提供服务。

k8s spring 启动 pod 准备就绪和活跃度探测失败

问题描述

3 个解决方案

解决方案1
1 2022-02-02 15:41:31

解决方案2
1 2022-02-02 16:05:42

解决方案3
0 2022-02-02 16:27:31

k8s spring 启动 pod 准备就绪和活跃度探测失败

问题描述

3 个解决方案

解决方案1 1 2022-02-02 15:41:31

解决方案2 1 2022-02-02 16:05:42

解决方案3 0 2022-02-02 16:27:31

解决方案1
1 2022-02-02 15:41:31

解决方案2
1 2022-02-02 16:05:42

解决方案3
0 2022-02-02 16:27:31