Skip to content

Remove helm value for gpuResourcesEnabledOverride#463

Draft
klueska wants to merge 6 commits intoNVIDIA:mainfrom
klueska:remove-gpuResourcesEnabledOverride
Draft

Remove helm value for gpuResourcesEnabledOverride#463
klueska wants to merge 6 commits intoNVIDIA:mainfrom
klueska:remove-gpuResourcesEnabledOverride

Conversation

@klueska
Copy link
Collaborator

@klueska klueska commented Aug 13, 2025

With this removed, the GPU side of the driver will be enabled by default with no safeguards to ensure that someone explicity sets it to false if that is their actual intention.

@copy-pr-bot
Copy link

copy-pr-bot bot commented Aug 13, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@klueska klueska modified the milestones: v25.3.0, unscheduled Aug 13, 2025
@klueska klueska added the config issue/PR about user-facing configuration interface label Aug 13, 2025
@klueska klueska force-pushed the remove-gpuResourcesEnabledOverride branch from aa5a432 to 155186d Compare August 20, 2025 18:48
This is a straight copy of this PR
kubernetes-sigs/dra-example-driver#75, with
minimal changes to make it work in this repo.

Signed-off-by: Kevin Klues <kklues@nvidia.com>
Signed-off-by: Kevin Klues <kklues@nvidia.com>
Signed-off-by: Kevin Klues <kklues@nvidia.com>
Signed-off-by: Kevin Klues <kklues@nvidia.com>
Previously, the webhook only supported cert-manager with a self-signed
issuer for TLS certificate generation and management. With this patch,
users can now provide their own TLS certificates via secret mode or use
custom cert-manager issuers (issuer/clusterissuer) with specified issuer
names.

Signed-off-by: Kevin Klues <kklues@nvidia.com>
With this removed, the GPU side of the driver will be enabled by default
with no safeguards to ensure that someone explicity sets it to false if
that is their actual intention.

Signed-off-by: Kevin Klues <kklues@nvidia.com>
@klueska klueska force-pushed the remove-gpuResourcesEnabledOverride branch from 155186d to 8df56a8 Compare August 20, 2025 21:14
@klueska klueska modified the milestones: unscheduled, v25.12.0 Nov 24, 2025
@klueska klueska self-assigned this Nov 24, 2025
@jgehrcke
Copy link
Collaborator

jgehrcke commented Dec 9, 2025

@klueska should we revive this? Do we only need 8df56a8 from your PR branch here?

@klueska klueska modified the milestones: v25.12.0, Backlog Jan 21, 2026
@jgehrcke
Copy link
Collaborator

jgehrcke commented Feb 2, 2026

We still need a patch like this for the 25.12.0 release -- correct?

(With the 25.12.0 release, we don't want people to have to set --set gpuResourcesEnabledOverride=true anymore -- correct?)

CC @shivamerla

@klueska
Copy link
Collaborator Author

klueska commented Feb 2, 2026

No! We still don't want this on by default until the DRA extended resources API is GA in upstream Kubernetes. The change relevant to this release was to change the error message. It was merged here: #809

@jgehrcke
Copy link
Collaborator

jgehrcke commented Feb 2, 2026

Got it. I missed #809 -- the new error message is clear.

@klueska klueska removed this from the Backlog milestone Feb 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

config issue/PR about user-facing configuration interface

Projects

Status: Backlog

Development

Successfully merging this pull request may close these issues.

2 participants