[WIP] Add e2e test for KubeRay NativeWorkloadScheduling by mboersma · Pull Request #6227 · kubernetes-sigs/cluster-api-provider-azure

mboersma · 2026-04-10T20:24:48Z

What type of PR is this?

/kind feature

What this PR does / why we need it:

Add a new e2e test that exercises the unreleased NativeWorkloadScheduling feature from the kuberay workload-poc branch. This feature uses the Kubernetes-native scheduling.k8s.io/v1alpha2 API (Workload + PodGroup) for gang scheduling of Ray pods.

Changes:

New cluster template ci-version-native-scheduling with K8s feature gates GenericWorkload, GangScheduling, and runtime config for scheduling.k8s.io/v1alpha2
InstallHelmChartFromPath and InstallKubeRayOperatorFromSource helpers for installing kuberay from a local chart with custom image
KubeRayNativeSchedulingSpec test that creates a RayCluster with the opt-in annotation, verifies Workload and PodGroup resources are created, and confirms all pods reach Running state
New Ginkgo test case tagged [KubeRay] [NativeScheduling]

Which issue(s) this PR fixes:
Fixes #

Special notes for your reviewer:

TODOs:

squashed commits
includes documentation
adds unit tests
cherry-pick candidate

Release note:

NONE

k8s-ci-robot · 2026-04-10T20:24:51Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

k8s-ci-robot · 2026-04-10T20:24:57Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign jont828 for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

codecov · 2026-04-10T20:27:40Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 43.85%. Comparing base (a248a9c) to head (a94382b).
⚠️ Report is 10 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #6227   +/-   ##
=======================================
  Coverage   43.85%   43.85%           
=======================================
  Files         289      289           
  Lines       25341    25341           
=======================================
  Hits        11113    11113           
  Misses      13450    13450           
  Partials      778      778

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

… 1.36+ Add a new e2e test that exercises the unreleased NativeWorkloadScheduling feature from the kuberay workload-poc branch. This feature uses the Kubernetes-native scheduling.k8s.io/v1alpha2 API (Workload + PodGroup) for gang scheduling of Ray pods. Changes: - New cluster template ci-version-native-scheduling with K8s feature gates GenericWorkload, GangScheduling, and runtime config for scheduling.k8s.io/v1alpha2 - InstallHelmChartFromPath and InstallKubeRayOperatorFromSource helpers for installing kuberay from a local chart with custom image - KubeRayNativeSchedulingSpec test that creates a RayCluster with the opt-in annotation, verifies Workload and PodGroup resources are created, and confirms all pods reach Running state - New Ginkgo test case tagged [KubeRay] [NativeScheduling]

…dential provider dependency - Add scripts/ci-build-kuberay-operator.sh to clone marosset/kuberay@workload-poc, build the operator image, and push it to the local registry - Source the build script from ci-e2e.sh when GINKGO_FOCUS matches NativeScheduling - Remove ACR credential provider scripts and kubelet args from the ci-version-native-scheduling template (not needed without custom CCM) - Remove cloud-provider-azure-chart-ci HelmChartProxy (use released CCM) - Remove CLOUD_PROVIDER_AZURE_LABEL=azure-ci override from the test - Add _kuberay-source/ to .gitignore

The ci-version-native-scheduling template requires the azure-ci CCM chart variant with explicit image tags, same as other ci-version flavors. Without it the released cloud-provider-azure chart fails to install because it cannot auto-detect images for unreleased K8s versions. - Restore template to full ci-version parity (ACR credential provider, cloud-provider-azure-chart-ci HelmChartProxy) - Restore CLOUD_PROVIDER_AZURE_LABEL=azure-ci in the test - Trigger CCM build in ci-e2e.sh when GINKGO_FOCUS matches NativeScheduling

mboersma · 2026-04-10T20:44:09Z