Fix bad input and deployment container crash error in notebook tests #3609

javed-73 · 2025-06-03T13:01:58Z

Description

Couple of issues fixed in the PR

Bad input error.
Image-object-detection notebook
test run is failing because of bad input.

image_object_detection_job = automl.image_object_detection(
compute=compute_name,
experiment_name=exp_name,
training_data=my_training_data_input,
validation_data=my_validation_data_input,
target_column_name="label",
primary_metric="mean_average_precision",
tags={"my_custom_tag": "My custom value"},
)

image_object_detection_job.set_limits(
max_trials=2,
max_concurrent_trials=2,
)

Issue: max trials used for running the sweep pipeline job uses max trials as 2, which is throwing "Error: Input request is invalid" as can be seen in the job

Updating to max trials to 3 fixes it.
Job

Error: Container crash at deployment step
Failed build
Looking at the deployment logs, below is the error.
RuntimeError: Attempting to deserialize object on a CUDA device but torch.cuda.is_available() is False. If you are running on a CPU-only machine, please use torch.load with map_location=torch.device('cpu') to map your storages to the CPU.
Job
Fix: use GPU machine for deployment.

Checklist

I have read the contribution guidelines.
- General
- SDK
- CLI
I have coordinated with the docs team ([email protected]) if this PR deletes files or changes any file names or file extensions.
Pull request includes test coverage for the included changes.
This notebook or file is added to the CODEOWNERS file, pointing to the author or the author's team.

javed-73 · 2025-06-04T05:23:57Z

The image-object-detection test is failing later at deployment step which will be taken care in a separate PR. But as far as bad input failures is concerned, this change has worked.

yeshsurya

please merge post gates are successfull

SamGos93 · 2025-06-05T06:58:03Z

...image-object-detection-task-fridge-items-automl-image-object-detection-task-fridge-items.yml

@@ -85,6 +85,7 @@ jobs:
          source "${{ github.workspace }}/infra/bootstrapping/init_environment.sh";
          bash "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh" generate_workspace_config "../../.azureml/config.json";
          bash "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh" replace_template_values "automl-image-object-detection-task-fridge-items.ipynb";
+          sed -i 's/max_trials=2/max_trials=3/g' automl-image-object-detection-task-fridge-items.ipynb


Why is this not working with 2 but with 3 max trials?

max_trials as per documentation mentions the following:
Parameter for maximum number of configurations to sweep. Must be an integer between 1 and 1000. When exploring just the default hyperparameters for a given model algorithm, set this parameter to 1. Default value is 1.

Trying to understand why?

fix bad input error in notebook test

67d47cc

SamGos93 previously approved these changes Jun 4, 2025

View reviewed changes

yeshsurya approved these changes Jun 4, 2025

View reviewed changes

yeshsurya requested changes Jun 4, 2025

View reviewed changes

use GPU machine for deployment

1f1ebbc

javed-73 dismissed SamGos93’s stale review via 1f1ebbc June 4, 2025 11:43

use GPU machine for instance segmentation task deployment

7148cec

javed-73 changed the title ~~fix bad input error in notebook test~~ Fix bad input and deployment container crash error in notebook tests Jun 5, 2025

yeshsurya self-requested a review June 5, 2025 05:57

yeshsurya approved these changes Jun 5, 2025

View reviewed changes

SamGos93 reviewed Jun 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix bad input and deployment container crash error in notebook tests #3609

Fix bad input and deployment container crash error in notebook tests #3609

Uh oh!

javed-73 commented Jun 3, 2025 •

edited

Loading

Uh oh!

javed-73 commented Jun 4, 2025

Uh oh!

yeshsurya left a comment

Uh oh!

SamGos93 Jun 5, 2025

Uh oh!

Uh oh!

Fix bad input and deployment container crash error in notebook tests #3609

Are you sure you want to change the base?

Fix bad input and deployment container crash error in notebook tests #3609

Uh oh!

Conversation

javed-73 commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

javed-73 commented Jun 4, 2025

Uh oh!

yeshsurya left a comment

Choose a reason for hiding this comment

Uh oh!

SamGos93 Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

javed-73 commented Jun 3, 2025 •

edited

Loading