PROPOSAL: Reduce init container memory from 500Mi to 64Mi #1863

mkuratczyk · 2025-04-29T13:35:19Z

It'd be great to get some feedback from users who changed the default init container resources - what did you change them to? I've tested successfully deployed RabbitMQ with a 50Mi init container using:

rabbitmq:4.1-management
bitnami/rabbitmq:4.1
rabbitmq.packages.broadcom.com/vmware-tanzu-rabbitmq:4.1.0

Given that an init container failure would be a terrible user experience, I'd rather have some headroom, hence 64Mi.

When originally developed, it wasn't clear what would need to be done in the init container. With hindsight, there's very little to do and the init phase hasn't changed in a long time.

We also assumed that the resources requested for the init container would effectively be reused by the actual rabbitmq container later. Turns out that's not the case: kubernetes/kubernetes#124282 init container resources are considered even after the init container completes its task.

This closes #960

When originally developed, it wasn't clear what would need to be done in the init container. With hindsight, there's very little to do and the init phase hasn't changed in a long time. We also assumed that the resources requested for the init container would effectively be reused by the actual `rabbitmq` container later. Turns out that's not the case: kubernetes/kubernetes#124282 init container resources are considered even after the init container completes its task.

zvlb · 2025-05-02T13:18:09Z

We tested with this resources:

resources:
  limits:
    cpu: 10m
    memory: 50Mi
  requests:
    cpu: 10m
    memory: 50Mi

All working good.

I think we can install:

initContainerCPU    string = "20m"
initContainerMemory string = "64Mi"

without any potential issues for users

Zerpet · 2025-05-05T10:38:23Z

I like this proposal 👍 The init container doesn't do anything special, and it uses very basic shell commands like cp and mv.

zvlb · 2025-05-05T12:00:03Z

pls add changes to initContainerCPU before merge it)

mkuratczyk · 2025-05-05T13:06:05Z

@Zerpet I've also pushed a change in CPU from 100m to 20m. Still ok with this?

Zerpet · 2025-05-05T15:49:06Z

20m is 0.02 CPU 🤔 I understand the intention of lowering this value so drastically, after reading again kubernetes/kubernetes#124282 and revisiting Pod QoS documentation. I'm not opposed to this change, since there are claims that 20m CPU JustWorks™

mkuratczyk · 2025-05-05T15:58:22Z

Yeah, I even tried what happens if I try to use 20m to start RabbitMQ itself. I didn't wait long enough to see it running but nothing was crashing, it was just starting very very slowly... :)
Should not be a problem at all to just move some files around.

zvlb · 2025-05-05T17:11:17Z

@mkuratczyk @Zerpet Thank you!

Сan I know when to expect a release with these changes?
The last release was on January 24th and I'm a little scared to guess how long to wait for the next one. Maybe there is some kind of release plan?

mkuratczyk · 2025-05-05T18:06:16Z

There's just not that much to release usually. The Operator as it is, seems to be working well for a lot of people.
We can discuss within the team tomorrow and let you know.

One thing you I'd love to hear your input for is whether it's a problem that this change, as currently implemented, would restart all those clusters you have, since changes to the StatefulSet definition cause a restart. We could make some tweaks to prevent this I think

zvlb · 2025-05-05T18:26:20Z

I understand that restarting all RabbitMQ clusters is an unpleasant situation and could indeed cause dissatisfaction among users. However, in my case, I don’t consider this a problem. If someone deployed a RabbitMQ cluster with a single instance and expects 100% SLA—that’s the problem of whoever set it up that way. Besides the reboot, such RabbitMQ installations cannot be considered stable, as a pod can be terminated for a multitude of reasons, and changes to the StatefulSet are just one of them!

If the RabbitMQ cluster is deployed with more than one node, updating the StatefulSet should proceed without performance issues.

mkuratczyk · 2025-05-06T17:11:36Z

Version 2.13.0 is now available:
https://github.com/rabbitmq/cluster-operator/releases/tag/v2.13.0

mkuratczyk mentioned this pull request Apr 29, 2025

initContainerMemory and cpu should can be customize #960

Closed

Zerpet approved these changes May 5, 2025

View reviewed changes

Reduce CPU resources from 100m to 20m

b097eac

mkuratczyk merged commit 9cd65f0 into main May 5, 2025
13 checks passed

mkuratczyk deleted the lower-init-container-memory branch May 5, 2025 15:58

Zerpet added this to the 2.12.2 milestone May 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PROPOSAL: Reduce init container memory from 500Mi to 64Mi #1863

PROPOSAL: Reduce init container memory from 500Mi to 64Mi #1863

Uh oh!

mkuratczyk commented Apr 29, 2025

Uh oh!

zvlb commented May 2, 2025

Uh oh!

Zerpet commented May 5, 2025

Uh oh!

zvlb commented May 5, 2025

Uh oh!

mkuratczyk commented May 5, 2025

Uh oh!

Zerpet commented May 5, 2025

Uh oh!

mkuratczyk commented May 5, 2025

Uh oh!

Uh oh!

zvlb commented May 5, 2025

Uh oh!

mkuratczyk commented May 5, 2025

Uh oh!

zvlb commented May 5, 2025

Uh oh!

mkuratczyk commented May 6, 2025

Uh oh!

Uh oh!

PROPOSAL: Reduce init container memory from 500Mi to 64Mi #1863

PROPOSAL: Reduce init container memory from 500Mi to 64Mi #1863

Uh oh!

Conversation

mkuratczyk commented Apr 29, 2025

Uh oh!

zvlb commented May 2, 2025

Uh oh!

Zerpet commented May 5, 2025

Uh oh!

zvlb commented May 5, 2025

Uh oh!

mkuratczyk commented May 5, 2025

Uh oh!

Zerpet commented May 5, 2025

Uh oh!

mkuratczyk commented May 5, 2025

Uh oh!

Uh oh!

zvlb commented May 5, 2025

Uh oh!

mkuratczyk commented May 5, 2025

Uh oh!

zvlb commented May 5, 2025

Uh oh!

mkuratczyk commented May 6, 2025

Uh oh!

Uh oh!