chore: upgrade Kubernetes deps and local images to 1.35 (incl. controller-runtime v0.23 and KAI-scheduler v0.15)#603
Conversation
CI status: blocked on KAI-Scheduler upstreamAll 7 failing E2E jobs share the same root cause, and it is not the kind/k3d 1.35 image bump itself. The k3d cluster (
KAI-Scheduler sideThe fix is already merged on
But no released tag contains it yet — the latest tags Proposed pathTwo options, in order of preference:
Marking the PR as pending upstream while we decide. Suggestions welcome. |
Thanks for raising this PR ! |
renormalize
left a comment
There was a problem hiding this comment.
https://github.com/kai-scheduler/KAI-Scheduler/releases/tag/v0.15.0 is out. Can the PR be rebased with the kai-scheduler bump? Thanks.
b9bdefd to
b72fd55
Compare
HI @renormalize Done. Rebased on main and bumped KAI-scheduler v0.14.0 → v0.15.0 |
b72fd55 to
8b64e6a
Compare
9d9e1af to
59c8a49
Compare
|
@yankay I took the liberty to sync the dependencies across the multiple Go modules we have in this repository, as they were not synced up after your rebased commit. We can merge once the E2E tests pass. |
Thanks @renormalize for the cross-module dep sync :-) |
|
@yankay Do you want to also update the installed KAI version in the docs / e2e scripts to v0.15.0 ? |
59c8a49 to
3e72a55
Compare
|
Thanks @enoodle! Good catch — bumped the installed KAI version to v0.15.0 in docs and e2e tooling PTAL 🙏 Waiting for the fix to be released in KAI v0.5.1 or v0.6.0, then we can re-run and confirm this PR. |
06da07e to
6ca589f
Compare
|
@danbar2 any idea on when KAI plans to release v0.15.1 or v0.16.0? |
|
@renormalize I will work on including this fix in v0.15.1 fix of KAI. |
|
@renormalize @yankay KAI v0.15.2 contains the fix you referenced. Can you update to use that? |
6ca589f to
cd75d03
Compare
cd75d03 to
24ffe87
Compare
Update Grove's Kubernetes dependency baseline from 1.34 to 1.35. - Bump k8s.io/* to v0.35.5 and controller-runtime to v0.23.3. - Bump KAI Scheduler to v0.15.2 for controller-runtime v0.23 compatibility. - Bump local kind and k3s images to the 1.35 line. - Raise the operator module Go directive to 1.26.3 for the updated KAI Scheduler dependency. - Refresh generated clients, CRDs, docs, and e2e install pins. Co-authored-by: Saketh Kalaga <saketh.kalaga@sap.com> Signed-off-by: Kay Yan <kay.yan@daocloud.io>
f8b9ea1 to
c754169
Compare
What type of PR is this?
/kind cleanup
What this PR does / why we need it:
Upgrades Grove's Kubernetes dependency baseline from 1.34 to 1.35.
k8s.io/*:v0.34.3->v0.35.5sigs.k8s.io/controller-runtime:v0.22.4->v0.23.3github.com/kai-scheduler/KAI-scheduler:v0.14.0->v0.15.2(required for controller-runtime v0.23)kindest/node:v1.34.3->v1.35.1rancher/k3s:v1.34.2-k3s1->v1.35.5-k3s1The installed KAI-scheduler version in docs and e2e tooling (
docs/installation.md,operator/e2e/README.md,operator/hack/{e2e-cluster,infra_manager}/dependencies.yaml,operator/hack/e2e-cluster/create-e2e-cluster.py) is also bumped tov0.15.2.KAI v0.15 changes
PodGroupSpec.MinMember/SubGroup.MinMemberfromint32to*int32; only the e2e SubGroup verifier needed a deref. Generated code refreshed viamake generateandmake generate-api-docs.Two incidental changes that came along with the upgrade:
operator/go.mod: Go toolchain bumped1.26.1->1.26.3, and the now-unusedoperator/clientrequire plus thek8s.io/kubeletreplace directive were dropped bygo mod tidy.operator/e2e/setup/k8s_clusters.go: raised client-side rate limits (QPS 50 / Burst 100) above the client-go defaults (5/10); e2e polling loops were otherwise hittingclient rate limiter Wait ... context deadline exceededunder the rolling/ondelete update tests on 1.35.Which issue(s) this PR fixes:
Fixes #602
Special notes for your reviewer:
Local:
go build ./...,go build -tags e2e ./e2e/...,go test ./...,go vet ./...all pass.Related design PR: #605
Does this PR introduce a API change?
Additional documentation e.g., enhancement proposals, usage docs, etc.: