A beginner's guide to deploying LLMs with AMD on Windows using PyTorch

4 days ago

Copy Link

AMD-optimized ONNX models available on Hugging Face for Ryzen™ AI APUs and Radeon™ GPUs.
PyTorch for AMD on Windows and Linux is now in public preview, supporting Radeon™ RX 7000/9000 series GPUs and select Ryzen™ AI APUs.
No dedicated AI infrastructure needed; a capable Windows PC with PyTorch and an AMD GPU suffices for LLM experimentation.
Guide provided for setting up and running LLMs locally on Windows with AMD hardware, requiring no prior PyTorch experience.
Supported hardware includes AMD Radeon™ AI PRO R9700, RX 7900 XTX, PRO W7900, and Ryzen™ AI Max series.
OS requirement: Windows 11 with specific AMD Software and Python 3.12.
Steps include creating a virtual environment, installing PyTorch for ROCm, and running a language model like Llama 3.2 1B.
Interactive chatbot example provided, demonstrating conversation memory capabilities.
Warning about 'Memory-Efficient Attention' in PyTorch is expected and does not affect functionality.

Hasty Briefsbeta