-
Notifications
You must be signed in to change notification settings - Fork 794
[ECOINT-212] Redpanda v2.1.0 - Support for lag metrics #2719
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
The tests now pass. Signed-off-by: Ben Pope <[email protected]>
As of Redpanda v25.1, some consumer group metrics can be disabled by changing cluster config "enable_consumer_group_metrics". These are still enabled by default, but split them out into their own grouping. Signed-off-by: Ben Pope <[email protected]>
Signed-off-by: Ben Pope <[email protected]>
Signed-off-by: Ben Pope <[email protected]>
…deprecating the older offset metrics
Signed-off-by: Paul Wilkinson <[email protected]>
Signed-off-by: Ben Pope <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My recommendation is to add these 4 widgets to the current dashboard (using a separate grouping widget). Once merged, these changes will be reflected in the original dashboards for existing customer installations.
If you'd like to keep this second dashboard, that's fine too.
In either case, the dashboard(s) will need to be brought inline with our best practices. See attached dashboard template.
Integration_Dashboard_Template.json
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Where can we find documentation on the dashboard best practices? Is this json template the only description of this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@hcoyote, I've found the best practice guides and have started reorganising the dashboards accordingly.
@dd-dominic, thanks for the recommendation, but I think we'd prefer to not have them in the main dashboard:
Firstly, there's a somewhat different audience for the lag metrics (Redpanda users interested in how their applications are behaving) vs the rest of the metrics (Redpanda administrators interested in how the cluster as a whole is behaving).
Secondly, the lag metrics are optional on the Redpanda side, and are only emitted once RP is configured to do so - so having them not show in the default dashboard feels off.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Understood, makes sense. Just make sure to update both dashboards to mirror our template as much as possible (colored group headings, branding/image, title casing, etc.)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done. It would be lovely to be able to specify our own branding colours via a hex code though 👍
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great! I just had one comment about testing
…to account for the change.
What does this PR do?
This adds support to the Redpanda integration for native lag metrics.
Motivation
New feature availability within Redpanda.
Review checklist
no-changelog
label attachedAdditional Notes
Anything else we should know when reviewing?