Skip to content

alexhegit/Playing-with-ROCm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Playing-with-ROCm

Here to show my experience about playing with ROCm with runable code, step-by-step tutorial to help you reproduce what I have did. If you have iGPU or dGPU of AMD, you may try Machine Learning with them.

NOTICE : For more easier tracking my update, I use 🆕 and 🔥 to flag the new hot topics.

Topics

Training

Finetuning

Inference

MLOPS with ROCm

Application/Demo


Projects work over ROCm

These projects may not offical announce to support ROCm GPU. But they work fine base on my verification.

Name URL Category Hands on
CLM-4-Voice https://github.com/THUDM/GLM-4-Voice Conversation AI
EchoMimic https://github.com/BadToBest/EchoMimic Digital Human GenAI Run EchoMimic with ROCm
Easy-Wav2Lip https://github.com/anothermartz/Easy-Wav2Lip Digital Human GenAI Easy-Wav2Lip-ROCm
GOT-OCR2 https://github.com/Ucas-HaoranWei/GOT-OCR2.0 end2end OCR
Moshi https://github.com/kyutai-labs/moshi Conversation AI
mini-omni https://github.com/gpt-omni/mini-omni Conversation AI
mini-omni2 https://github.com/gpt-omni/mini-omni2 Conversation AI
Picovoice/orca https://github.com/Picovoice/orca Conversation AI LLM_Voice_Assistant
Retrieval-based-Voice-Conversion-WebUI https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI.git Easily train a good VC model with voice data <= 10 mins!
Freeze-Omni 🆕 🔥 https://github.com/VITA-MLLM/Freeze-Omni A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM Realtime on Radeon W7900, realtime with good response, feel good than Moshi, mini-omni2
Step-Auido 🆕 🔥 https://github.com/stepfun-ai/Step-Audio Convseration AI Too big model, not real time
Step-Video-T2V 🆕 🔥 https://github.com/stepfun-ai/Step-Video-T2V Video GenAI Run with 1xMI300X
UI-TARS https://github.com/bytedance/UI-TARS Automated GUI Interaction with Native Agentsfrom ByteDance
Qwen2.5-Omni 🆕 🔥 https://github.com/QwenLM/Qwen2.5-Omni end-to-end multimodal model in the Qwen serie
CosyVoice https://github.com/FunAudioLLM/CosyVoice TTS LLM tutorial , conda-env

Wish List

Name URL Category Hands on
hertz-dev https://github.com/Standard-Intelligence/hertz-dev Conversation AI
Freeze-Omni https://github.com/VITA-MLLM/Freeze-Omni Conversation AI
LLaMA-Omni https://github.com/ictnlp/LLaMA-Omni Conversation AI
ichigo Llama 3.1 https://github.com/homebrewltd/ichigo Conversation AI
ichigo-demo https://github.com/homebrewltd/ichigo-demo/tree/docker
Exo https://github.com/exo-explore/exo heterogeneous distribute inference
Perpleica https://github.com/ItzCrazyKns/Perplexica AI Search Engine issue
MiniPerplx https://github.com/zaidmukaddam/miniperplx A minimalistic AI-powered search engine
ollama-helm https://github.com/otwld/ollama-helm
OpenHands https://github.com/All-Hands-AI/OpenHands a platform for software development agents powered by AI
HayStack https://github.com/deepset-ai/haystack end-to-end LLM framework that allows you to build applications powered by LLMs
Bailing https://github.com/ictnlp/BayLing
Bailing https://github.com/wwbin2017/bailing
BabelDuck https://github.com/Orenoid/BabelDuck Beginner-friendly AI conversation practice application
KubeAI https://github.com/substratusai/kubeai deploy and manage AI models on Kubernetes
DSPy https://dspy.ai the framework for programming
KServe https://kserve.github.io/website/latest/
Camel-ai/OWL https://github.com/camel-ai/owl
VITA https://github.com/VITA-MLLM/VITA VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
DiffRhythm https://github.com/ASLP-lab/DiffRhythm End-to-End Full-Length Song Generation with Latent Diffusion
Open-Sora https://github.com/hpcaitech/Open-Sora
Real-Time-Voice-Cloning https://github.com/CorentinJ/Real-Time-Voice-Cloning
OpenVoice https://github.com/myshell-ai/OpenVoice
KrilinAI https://github.com/krillinai/KrillinAI
RealtimeVoiceChat https://github.com/KoljaB/RealtimeVoiceChat
pipecat https://github.com/pipecat-ai/pipecat

Tracing

Misc

MCP

3rd-stuff


@misc{ Playing with ROCm,
  author = {He Ye (Alex)},
  title = {Playing with ROCm: share my experience and practice},
  howpublished = {\url{https://alexhegit.github.io/}},
  year = {2024--}
}

About

See how to play with ROCm, run it with AMD GPUs!

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published