LivenessProbes not working #26398

sebidude · 2024-05-24T08:00:38Z

Name and Version

bitnami/etcd 10.1.0

What architecture are you using?

amd64

What steps will reproduce the bug?

Install the chart with TLS stuff enabled in the values.yaml

auth:
rbac:
  create: false
token:
  type: simple
client:
  secureTransport: true
  existingSecret: "etcd-client-certs"
  enableAuthentication: true
  caFilename: "ca.crt"
peer:
  secureTransport: true
  useAutoTLS: true
  caFilename: "ca.crt"

Pods restart after some time and the etcd cluster is in a really bad state

What is the expected behavior?

The cluster should be up an running stable

What do you see instead?

Pods restart after a short time due to failing http livenessProbes on PodIP:2379/health

Additional information

This seems to be related to #25984 where the livenessProbes changed.

kaykhan · 2024-05-24T09:33:07Z

I've also just encountered this. Both readiness probe and liveness probe are failing after installing nginx helm install my-nginx bitnami/nginx --version 17.2.1

I have a Kubernetes EKS cluster which is only IPV6. Curious if you have a similar setup, i wonder if its to do with the liveness and readiness probe are not configured correctly for ipv6

sebidude · 2024-05-24T12:03:25Z

I have a Kubernetes EKS cluster which is only IPV6. Curious if you have a similar setup, i wonder if its to do with the liveness and readiness probe are not configured correctly for ipv6

We run self-managed K8s Clusters on-prem and on cloud infrastructure. This was failing in a dev stage cluster. IPv4 only.
The only thing which was changed was the liveness probes. For now we just rolled back to 10.0.11

danielb43 · 2024-05-24T18:33:57Z

bitnami/etcd 10.1.1 is also affected by the original issue.

ismaildem · 2024-06-05T14:52:58Z

I think the problem here is that the incorrect port is being queried.
The endpoint livez is available via the metrics port and exclusively via http.
So if you have metrics.useSeparateEndpoint enabled, the liveness probe must use the port defined by .Values.containerPorts.metrics

BobVanB · 2024-06-06T06:13:44Z

I have the same problem when disabling rbac and only use client authentication with certificates.
The probe https://<>:2379/livez is not going through, because there is no client certificate passed.

There is a simple workaround until this is fixed inside the template:

customLivenessProbe:
  httpGet:
    port: 9090
    path: /livez
    scheme: HTTP
  initialDelaySeconds: 60
  periodSeconds: 30
  timeoutSeconds: 5
  successThreshold: 1
  failureThreshold: 5

metrics:
  useSeparateEndpoint: true

fmulero · 2024-06-14T16:46:57Z

Thanks a lot @BobVanB

To be perfectly blunt I don't see an easy solution if we kept the /livez endpoint and that change was intentional. Do you have any proposal in mind? Please feel free to open a PR.

BobVanB · 2024-06-14T20:51:38Z

Hi @fmulero

I'm not going to touch this topic any further. There is enough discussion about this.
For example: etcd-io/etcd#16007
It would be nice if we got this: etcdctl endpoint live

With kind regards,

github-actions · 2024-07-02T01:26:39Z

This Issue has been automatically marked as "stale" because it has not had recent activity (for 15 days). It will be closed if no further activity occurs. Thanks for the feedback.

github-actions · 2024-07-07T01:27:17Z

Due to the lack of activity in the last 5 days since it was marked as "stale", we proceed to close this Issue. Do not hesitate to reopen it later if necessary.

sebidude added the tech-issues The user has a technical issue about an application label May 24, 2024

github-actions bot added the triage Triage is needed label May 24, 2024

github-actions bot assigned carrodher May 24, 2024

carrodher added etcd in-progress labels May 24, 2024

github-actions bot removed the triage Triage is needed label May 24, 2024

github-actions bot assigned fmulero and unassigned carrodher May 24, 2024

github-actions bot added the stale 15 days without activity label Jul 2, 2024

github-actions bot added the solved label Jul 7, 2024

bitnami-bot closed this as not planned Won't fix, can't repro, duplicate, stale Jul 7, 2024

github-actions bot removed the in-progress label Jul 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LivenessProbes not working #26398

LivenessProbes not working #26398

sebidude commented May 24, 2024 •

edited by carrodher

Loading

kaykhan commented May 24, 2024 •

edited

Loading

sebidude commented May 24, 2024

danielb43 commented May 24, 2024

ismaildem commented Jun 5, 2024

BobVanB commented Jun 6, 2024 •

edited

Loading

fmulero commented Jun 14, 2024

BobVanB commented Jun 14, 2024 •

edited

Loading

github-actions bot commented Jul 2, 2024

github-actions bot commented Jul 7, 2024

LivenessProbes not working #26398

LivenessProbes not working #26398

Comments

sebidude commented May 24, 2024 • edited by carrodher Loading

Name and Version

What architecture are you using?

What steps will reproduce the bug?

What is the expected behavior?

What do you see instead?

Additional information

kaykhan commented May 24, 2024 • edited Loading

sebidude commented May 24, 2024

danielb43 commented May 24, 2024

ismaildem commented Jun 5, 2024

BobVanB commented Jun 6, 2024 • edited Loading

fmulero commented Jun 14, 2024

BobVanB commented Jun 14, 2024 • edited Loading

github-actions bot commented Jul 2, 2024

github-actions bot commented Jul 7, 2024

sebidude commented May 24, 2024 •

edited by carrodher

Loading

kaykhan commented May 24, 2024 •

edited

Loading

BobVanB commented Jun 6, 2024 •

edited

Loading

BobVanB commented Jun 14, 2024 •

edited

Loading