Issues with prometheus getting evicted due to memory

mmclane · October 7, 2025, 2:17pm

This morning I have noticed we have been having some issues causing pod restarts. After digging into it I think it is related to linkerd pods getting evicted because the prometheus pod in that namespace is running very high. It has gotten into a restart loop where I had to delete the WAL files to get the pod to start. Its currently running with memory between 3.5 and 4.5 GB which is a LOT higher then any other prometheus install on my cluster. It is easily the single biggest consumer of memory of any pod by a factor of 4.

What can I do to reduce the amount of memory that this pod consumes?

Flynn · October 7, 2025, 2:20pm

For context, is this a Prometheus from linkerd-viz or one you manage?

mmclane · October 7, 2025, 2:31pm

This specifically is from linkerd-viz. We have others that we manage and we don’t see this issue with them.

Flynn · October 7, 2025, 2:34pm

Yeah, the linkerd-viz Prometheus holds all the trace data in memory; it is definitely not appropriate for production usage. Check out Bringing your own Prometheus | Linkerd for the guide on replacing it with a Prometheus that you manage (which, of course, could be one you already have running).

mmclane · October 7, 2025, 2:36pm

Ah.. and we did just scale up our infra yesterday. That kinda makes sense.

Topic		Replies	Views
Potential (slow) memory leak for linkerd-proxy sidecar for single workload on 2.13.3 Linkerd General Discussion	8	463	July 3, 2023
Linkerd destination control plane pod restarts Linkerd General Discussion configuration	1	574	September 27, 2023
Cannot see HTTP stats in viz dashboard Linkerd General Discussion	13	1886	December 7, 2023
Linkerd Proxy memory usage increase & OOM when app response with ~5MB payload over ~12 requests/sec Linkerd General Discussion proxy	1	1435	July 24, 2023
OOM killed linkerd-proxy Linkerd General Discussion	1	441	February 13, 2024

Issues with prometheus getting evicted due to memory

Related topics