Hi all,
We use Linkerd for gRPC load balancing on AWS EKS. After upgrading to Linkerd stable-2.13.0, our k8s services sometimes encountered gRPC 14 UNAVAILABLE: connections to all backends failing
errors.
We discovered it was because of the memory leak issue on the system.mem.slab
memory of AWS EKS nodes, and seems only happened on AWS nodes with arrch64 (arm64) CPU.
And we confirmed this issue started happening since Edge-23.2.3
, we suspect this issue was caused by bumping the version of slab dependency to 0.4.8
(PR)
- Linkerd version: Edge-23.2.3
- Platform
- Kernel Name: Linux
- Kernel Release: 5.10.167-147.601.amzn2.aarch64
- Kernel Version: #1 SMP Tue Feb 14 21:50:23 UTC 2023
- Processor: aarch64
Would it be possible to request your assistance in checking on this matter? I sincerely appreciate your help.