merge db_main to release #13

jnyi · 2025-03-15T02:06:42Z

a number of changes have been in dev for a long time, and this is the new one: #12

* Update dependencies * Correct PR number

…fana#146) This addresses a bug in rollout-operator where: 1. Kubernetes receives a request to downscale a statefulset by `X` hosts. 2. The prepare-downscale admission webhook attempts to prepare `X` pods for shutdown by sending an HTTP `POST` to their handler identified by the `grafana.com/prepare-downscale-http-path` and `-port` annotations. 3. At least one of these requests fails. The admission webhook returns an error to Kubernetes, so the downscale is not approved. 4. 💥 But some hosts may have been prepared for downscale. 💥 This PR adds cleanup logic to issue `DELETE` requests on all involved pods if any of the `POST`s failed. Notes: * `DELETE` calls are attempted once. * `DELETE` failures are logged but otherwise ignored. * For simplicity, we'll invoke `DELETE` on all of the pods involved in the scaledown operation, not just ones that received a POST. This doesn't fix the similar issue where replica count changing from 10->9->10 leaves that one pod prepared for shutdown. (But that's in the works.)

Add a changelog entry for grafana#146, and prepare changelog for v0.16.0. Co-authored-by: Patryk Prus <[email protected]> --------- Co-authored-by: Patryk Prus <[email protected]>

* Swap base image from alpine to distroless * Remove user setup * Use nonroot image * Add different base image for boringcrypto * Add changelog entry

For better debuggability when there are concurrent webhook calls.

…ana#141) Signed-off-by: JordanRushing <[email protected]>

* Include UserInfo.Username in 'handling request' log. * Changelog.

* Add support for specifying percentage in rollout-max-unavailable annotation. * CHANGELOG.md

Fix unbalanced pairs in log, leading to a log message like this: `level=error ts=2024-06-13T03:30:49.769575693Z pod=ingester-zone-a-16 url=http://ingester-zone-a-16.ingester-zone-a.mimir-dev.svc.cluster.local./ingester/prepare-partition-downscale errorsendingHTTPPOSTrequesttoendpoint=err`

* When checking downscale delay in the statefulset allow downscale if some pods at the end of statefulset are ready to be downscaled. * CHANGELOG.md

…s to store (grafana#151) Fix a snag found in grafana#146 where if the "downscaled" annotation/configmap fails to persist, the scale operation is denied, but the pods are not informed via DELETE that they should no longer shutdown.

) * Only scale up zone after leader zone replicas are ready * Update CHANGELOG * Change to only scaling once all replicas are ready * Rename config annotation * Add log line * remove redundant test * Update changelog

* Update dependencies * Update CHANGELOG * Fix build errors * Upgrade docker and grpc for remaining CVEs

* Update Go to 1.23 * Add some nolint

* Added grafana.com/rollout-mirror-replicas-from-resource-update-status-replicas annotation to optionally disable patching of reference resource when using scaling based on reference resource. * Review findings. * CHANGELOG entry.

…-status-replicas` annotation (grafana#171) * Renamed `grafana.com/rollout-mirror-replicas-from-resource-write-back-status-replicas` annotation to `grafana.com/rollout-mirror-replicas-from-resource-write-back` * Fix changelog.

Merge remote-tracking branch 'upstream/main' into merge-upstream-v0.20.0

* fix: add support for delayed downscale port in the URL * update changelog * Update CHANGELOG.md Co-authored-by: Marco Pracucci <[email protected]> --------- Co-authored-by: Marco Pracucci <[email protected]>

merge upstream

Add boolean downscale logic

add metrics for scale down debugging

Signed-off-by: Yi Jin <[email protected]>

[PLAT-129166] fix parallel db update

yuchen-db

lgtm

johannaratliff and others added 30 commits April 4, 2024 12:54

CHANGELOG for license change (grafana#142)

ce7b413

Update dependencies (grafana#144)

afc5577

* Update dependencies * Correct PR number

Cut v0.15.0 (grafana#145)

ea17193

Adjust changelog for v0.16.0 (grafana#147)

1a50079

Add a changelog entry for grafana#146, and prepare changelog for v0.16.0. Co-authored-by: Patryk Prus <[email protected]> --------- Co-authored-by: Patryk Prus <[email protected]>

Swap base image from alpine to distroless (grafana#149)

54b71c1

* Swap base image from alpine to distroless * Remove user setup * Use nonroot image * Add different base image for boringcrypto * Add changelog entry

Add request UID to webhook logs. (grafana#150)

3b0040a

For better debuggability when there are concurrent webhook calls.

Check for non-updated replicas during down-scale in zoneTracker (graf…

0b18d84

…ana#141) Signed-off-by: JordanRushing <[email protected]>

Include username in 'handling request' log. (grafana#152)

b36d4bc

* Include UserInfo.Username in 'handling request' log. * Changelog.

Support percentages in rollout-max-unavailable annotation (grafana#153)

fdfe18c

* Add support for specifying percentage in rollout-max-unavailable annotation. * CHANGELOG.md

Allow delayed downscale of subset of pods (grafana#156)

5dce3cc

* When checking downscale delay in the statefulset allow downscale if some pods at the end of statefulset are ready to be downscaled. * CHANGELOG.md

Update changelog for v0.17.0. (grafana#157)

f5bef38

Prep changelog for 0.17.1. (grafana#158)

53c59f6

Prepare 0.18 release (grafana#166)

b21cc68

Update release doc (grafana#167)

eaf0138

Update dependencies (grafana#165)

5be56c9

* Update dependencies * Update CHANGELOG * Fix build errors * Upgrade docker and grpc for remaining CVEs

Update Go to 1.23 (grafana#168)

2608ac1

* Update Go to 1.23 * Add some nolint

Prepare v0.19.0. (grafana#170)

a37d1cf

Release v0.19.1. (grafana#172)

ef21e37

Update dependencies (grafana#174)

75e10d6

Cut v0.20.0 (grafana#175)

1cb25f6

Merge remote-tracking branch 'upstream/main' into merge-upstream-v0.20.0

7d10753

Merge pull request #7 from databricks/yuchen-db/merge-upstream-v0.20.0

3f1e1e2

Merge remote-tracking branch 'upstream/main' into merge-upstream-v0.20.0

fix: add support for delayed downscale port in the URL (grafana#176)

9d26ade

* fix: add support for delayed downscale port in the URL * update changelog * Update CHANGELOG.md Co-authored-by: Marco Pracucci <[email protected]> --------- Co-authored-by: Marco Pracucci <[email protected]>

Cut v0.20.1 (grafana#177)

e74c10f

yuchen-db and others added 12 commits November 18, 2024 14:41

Merge remote-tracking branch 'upstream/main' into upstream

2e3cc5e

Merge pull request #9 from databricks/yuchen-db/upstream

1ce3edb

merge upstream

add boolean downscale logic

676cc83

add unit tests

a47baaf

Merge pull request #10 from databricks/yuchen-db/downscale

845beea

Add boolean downscale logic

add metrics for scale down debugging

e43c03e

fix typo

72333ad

add more metrics

138930b

Merge pull request #11 from databricks/yuchen-db/scale-down-metric

8dad033

add metrics for scale down debugging

[PLAT-129166] fix parallel db update

10571c5

Signed-off-by: Yi Jin <[email protected]>

test old behavior pass integration tests

71ce581

Signed-off-by: Yi Jin <[email protected]>

Merge pull request #12 from jnyi/PLAT-129166-parallel-db-update

2d46643

[PLAT-129166] fix parallel db update

jnyi requested review from hczhu-db and yuchen-db March 15, 2025 02:06

yuchen-db approved these changes Mar 17, 2025

View reviewed changes

jnyi merged commit 2b06b52 into release Mar 17, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

merge db_main to release #13

merge db_main to release #13

Uh oh!

jnyi commented Mar 15, 2025

Uh oh!

yuchen-db left a comment

Uh oh!

Uh oh!

Uh oh!

merge db_main to release #13

merge db_main to release #13

Uh oh!

Conversation

jnyi commented Mar 15, 2025

Uh oh!

yuchen-db left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!