Update Axolotl Examples #2502

Bihan · 2025-04-11T08:27:37Z

Updated Axolotl Nvidia Example with Llama 4 Scout
Resolve module not found error for AMD
Error
ModuleNotFoundError: No module named 'pynvml.nvml'; 'pynvml' is not a package
Solution
Installed previous release of pynvml
pip install pynvml==11.5.3

Updated Axolotl Nvidia Example with Llama 4 Scout Update AMD axolotl example for dependency error

peterschmidt85 · 2025-04-15T14:14:31Z

examples/accelerators/amd/README.md

    # Using RunPod's ROCm Docker image
    image: runpod/pytorch:2.1.2-py3.10-rocm6.0.2-ubuntu22.04
    # Required environment variables
    env:
      - HF_TOKEN
+      - WANDB_API_KEY
+      - WANDB_PROJECT


How is WANDB_API_KEY not enough?

No, we need to set WANDB_PROJECT and WANDB_NAME.

The difference is that in our current master it is set in the config file and in this PR, we are sending it as an argument. When we send it as argument, we do not need to include the config.yaml in our repo.

Okay, then at least let's hardcode the value of WANDB_NAME, e.g. to axolotl-amd-llama31-train. If the user wants, they would change it.

BTW this is another user case when we could set it to $DSTACK_RUN_NAME

peterschmidt85 · 2025-04-15T14:15:32Z

examples/accelerators/amd/README.md

@@ -177,6 +182,8 @@ To request multiple GPUs, specify the quantity after the GPU name, separated by
      - cd axolotl
      - git checkout d4f6c65
      - pip install -e .
+      - pip uninstall pynvml -y
+      - pip install pynvml==11.5.3


Should we add a note or at least a comment on it?

Yes. I will add.

peterschmidt85 · 2025-04-15T14:15:58Z

examples/accelerators/amd/README.md

@@ -177,6 +182,8 @@ To request multiple GPUs, specify the quantity after the GPU name, separated by
      - cd axolotl
      - git checkout d4f6c65


Why this particular revision??

xformers is incompatible with ROCm. Axolotl suggests applying workarounds as suggested in this link

In the revision d4f6c65, workaround is implemented. This is how ROCm builds Axolotl image. link

Then a comment is needed I suppose

Yes. I will update it accordingly.

peterschmidt85

Please feel free to merge

Update Axolotl Examples

46c3a47

Updated Axolotl Nvidia Example with Llama 4 Scout Update AMD axolotl example for dependency error

Bihan force-pushed the update_axolotl_example branch from 0c9cb3b to 46c3a47 Compare April 11, 2025 10:18

Bihan requested a review from peterschmidt85 April 16, 2025 08:41

peterschmidt85 reviewed Apr 16, 2025

View reviewed changes

Resolve Review Comments

9b002f5

Bihan requested a review from peterschmidt85 April 16, 2025 16:34

peterschmidt85 approved these changes Apr 17, 2025

View reviewed changes

Bihan merged commit 99a88d3 into dstackai:master Apr 17, 2025
24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update Axolotl Examples #2502

Update Axolotl Examples #2502

Uh oh!

Bihan commented Apr 11, 2025

Uh oh!

peterschmidt85 Apr 15, 2025

Uh oh!

Bihan Apr 16, 2025

Uh oh!

peterschmidt85 Apr 16, 2025

Uh oh!

peterschmidt85 Apr 16, 2025

Uh oh!

peterschmidt85 Apr 15, 2025

Uh oh!

Bihan Apr 16, 2025

Uh oh!

peterschmidt85 Apr 15, 2025

Uh oh!

Bihan Apr 16, 2025

Uh oh!

peterschmidt85 Apr 16, 2025

Uh oh!

Bihan Apr 16, 2025

Uh oh!

peterschmidt85 left a comment

Uh oh!

Uh oh!

Uh oh!

		@@ -177,6 +182,8 @@ To request multiple GPUs, specify the quantity after the GPU name, separated by
		- cd axolotl
		- git checkout d4f6c65

Update Axolotl Examples #2502

Update Axolotl Examples #2502

Uh oh!

Conversation

Bihan commented Apr 11, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

peterschmidt85 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!