schedulers/kubernetes_scheduler: add support for resource instance-type node selectors #433
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This allows specifying specific instance types when scheduling kubernetes jobs. It uses
node_selectors
and thenode.kubernetes.io/instance-type
label on nodes to limit pods to specific instances.To avoid instance type cpu and memory hitting issues with the node reserved cpu/mem this will subtract a small amount of CPU and memory from the requested resources. Limits remains the same.
Also adds g4dn.xlarge resource type.
Test plan:
Unit tests, updated kube dist integration test to specify instance type
https://github.com/pytorch/torchx/runs/5698855673?check_suite_focus=true