AI Blogs

AI Blogs#

Aligning Mixtral 8x7B with TRL on AMD GPUs

This blog demonstrates how to fine-tune and align Mixtral 8x7B with TRL using DPO and evaluate it on AMD GPUs.

June 12, 2025 by Clint Greene

Introducing Instella-Long: A Fully Open Language Model with Long-Context Capability

Learn about Instella-Long: AMD’s open 3B language model supporting 128K context, trained on MI300X GPUs, outperforming peers on long-context benchmarks.

June 11, 2025 by Jialian Wu, Jiang Liu, Sudhanshu Ranjan, Xiaodong Yu, Gowtham Ramesh, Prakamya Mishra, Zicheng Liu, Yusheng Su, Ximeng Sun, Ze Wang, Emad Barsoum

AMD ROCm: Powering the World's Fastest Supercomputers

Discover how ROCm drives the world’s top supercomputers, from El Capitan to Frontier, and why its shaping the future of scalable, open and sustainable HPC

June 10, 2025 by Mohammed Faraaz Mustafa, Saad Rahim

LLM Quantization with Quark on AMD GPUs: Accuracy and Performance Evaluation

Learn how to use Quark to apply FP8 quantization to LLMs on AMD GPUs, and evaluate accuracy and performance using vLLM and SGLang on AMD MI300X GPUs.

June 09, 2025 by Sean Song

Ecosystems & Partners

The ROCm Revisited Series

We present our ROCm Revisited Series. Discover ROCm's role in leading edge supercomputing, its growing ecosystem-from HIP, to developer tools-powering AI, HPC, and data science across multi-GPU and cluster systems

June 06, 2025 by Mohammed Faraaz Mustafa, Liam Berry, Saad Rahim

ROCm Revisited: Evolution of the High-Performance GPU Computing Ecosystem

Learn how ROCm evolved to support HPC, AI, and containerized workloads with modern tools, libraries, and deployment options.

June 06, 2025 by Liam Berry, Saad Rahim

A Step-by-Step Guide On How To Deploy Llama Stack on AMD Instinct™ GPU

Learn how to use Meta’s Llama Stack with AMD ROCm and vLLM to scale inference, integrate APIs, and streamline production-ready AI workflows on AMD Instinct™ GPU

April 22, 2025 by Alex He

ROCm 6.4: Breaking Barriers in AI, HPC, and Modular GPU Software

Explore ROCm 6.4's key advancements: AI/HPC performance boosts, enhanced profiling tools, better Kubernetes support and modular drivers, accelerating AI and HPC workloads on AMD GPUs.

April 11, 2025 by Jayacharan Kolla, Aditya Bhattacharji, Farshad Ghodsian, Saad Rahim, Marco Grond, Ronnie Chatterjee

Applications & Models

Reproduce AMD's MLPerf Training v5.0 Submission Result with Instinct™ GPUs

Follow this step-by-step guide to reproduce AMDs MLPerf 5.0 Training Submission with Instinct GPUs using ROCm

June 04, 2025 by Meena Arunachalam, Miro Hodak, Ravi Dwivedula, Su Ann Chong, Sarthak Arora, Sathish Sanjeevi, Karan Verma, Eliot Li

AMD’s MLPerf Training Debut: Optimizing LLM Fine-Tuning with Instinct™ GPUs

Explore the techniques we used to improve the training performance on MI300X and MI325X in our MLPerf Training 5.0 submission.

June 04, 2025 by Meena Arunachalam, Miro Hodak, Ravi Dwivedula, Sarthak Arora, Sathish Sanjeevi, Su Ann Chong, Karan Verma, Eliot Li

High-Throughput BERT-L Pre-Training on AMD Instinct™ GPUs: A Practical Guide

Learn how to optimize BERT-L training with mixed precision and Flash Attention v2 on AMD Instinct GPUs — follow our tested MLPerf-compliant step-by-step guide.

June 03, 2025 by Meena Arunachalam, Miro Hodak, Ravi Dwivedula, Su Ann Chong, Sarthak Arora, Sathish Sanjeevi, Karan Verma, Eliot Li