Skip to content

[Feature] Support Tenstorrent's Wormhole accelerators #2573 #2574

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Apr 30, 2025

Conversation

peterschmidt85
Copy link
Contributor

No description provided.

@peterschmidt85 peterschmidt85 requested a review from un-def April 27, 2025 20:28
@peterschmidt85 peterschmidt85 linked an issue Apr 27, 2025 that may be closed by this pull request
@peterschmidt85
Copy link
Contributor Author

Also, depends on dstackai/gpuhunt#146

@@ -151,6 +151,14 @@ func (d *DockerRunner) restoreStateFromContainers(ctx context.Context) error {
gpuIDs = append(gpuIDs, device.PathOnHost)
}
}
case host.GpuVendorTenstorrent:
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@un-def not sure if this code is current and how to test it properly

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. Start a run
  2. Restart the shim: systemctl restart dstack-shim.service
  3. Check shim.log for the following line: locked GPU(s) due to running task

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I see

time=2025-04-30T11:05:26.154875+02:00 level=debug msg=locked GPU(s) due to running task task=a9c04738-e690-425f-9566-4fa1f4943abc gpus=[0]

Does it mean everything works, and we can merge?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, seems to work

@peterschmidt85
Copy link
Contributor Author

@un-def gpuhunt is merged so only this PR is pending

@peterschmidt85 peterschmidt85 merged commit d428e74 into master Apr 30, 2025
9 of 25 checks passed
@peterschmidt85 peterschmidt85 deleted the 2573-tenstorrents branch April 30, 2025 09:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Feature] Support Tenstorrent's Wormhole accelerators
2 participants