[Slurm scheduler] Add better support for specifying resources in slurm #359
Labels
bug
Something isn't working
module: runner
issues related to the torchx.runner and torchx.scheduler modules
slurm
slurm scheduler
🐛 Bug
According to aws/aws-parallelcluster#2198 PCluster has problems running jobs that have explicit memory requirements.
We need to modify our slurm scheduler to address this.
Module (check all that applies):
torchx.spec
torchx.component
torchx.apps
torchx.runtime
torchx.cli
torchx.schedulers
torchx.pipelines
torchx.aws
torchx.examples
other
To Reproduce
Steps to reproduce the behavior:
Expected behavior
Job successfully executed
The text was updated successfully, but these errors were encountered: