AI - Applications & Models

AI - Applications & Models#

Continued Pretraining: A Practical Playbook for Language-Specific LLM Adaptation

A step by step guide to adapting LLMs to new languages via continued pretraining, with Poro 2 boosting Finnish performance using Llama 3.1 and AMD GPUs

June 18, 2025 by Elaine Zosa, Jouni Louma, Kai Hakala, Antti Virtanen, Mika Koistinen, Jonathan Burdge

Aligning Mixtral 8x7B with TRL on AMD GPUs

This blog demonstrates how to fine-tune and align Mixtral 8x7B with TRL using DPO and evaluate it on AMD GPUs.

June 12, 2025 by Clint Greene

Introducing Instella-Long: A Fully Open Language Model with Long-Context Capability

Learn about Instella-Long: AMD’s open 3B language model supporting 128K context, trained on MI300X GPUs, outperforming peers on long-context benchmarks.

June 11, 2025 by Jialian Wu, Jiang Liu, Sudhanshu Ranjan, Xiaodong Yu, Gowtham Ramesh, Prakamya Mishra, Zicheng Liu, Yusheng Su, Ximeng Sun, Ze Wang, Emad Barsoum

LLM Quantization with Quark on AMD GPUs: Accuracy and Performance Evaluation

Learn how to use Quark to apply FP8 quantization to LLMs on AMD GPUs, and evaluate accuracy and performance using vLLM and SGLang on AMD MI300X GPUs.

June 09, 2025 by Sean Song

Reproduce AMD's MLPerf Training v5.0 Submission Result with Instinct™ GPUs

Follow this step-by-step guide to reproduce AMDs MLPerf 5.0 Training Submission with Instinct GPUs using ROCm

June 04, 2025 by Meena Arunachalam, Miro Hodak, Ravi Dwivedula, Su Ann Chong, Sarthak Arora, Sathish Sanjeevi, Karan Verma, Eliot Li

AMD’s MLPerf Training Debut: Optimizing LLM Fine-Tuning with Instinct™ GPUs

Explore the techniques we used to improve the training performance on MI300X and MI325X in our MLPerf Training 5.0 submission.

June 04, 2025 by Meena Arunachalam, Miro Hodak, Ravi Dwivedula, Sarthak Arora, Sathish Sanjeevi, Su Ann Chong, Karan Verma, Eliot Li

High-Throughput BERT-L Pre-Training on AMD Instinct™ GPUs: A Practical Guide

Learn how to optimize BERT-L training with mixed precision and Flash Attention v2 on AMD Instinct GPUs — follow our tested MLPerf-compliant step-by-step guide.

June 03, 2025 by Meena Arunachalam, Miro Hodak, Ravi Dwivedula, Su Ann Chong, Sarthak Arora, Sathish Sanjeevi, Karan Verma, Eliot Li

Scale LLM Inference with Multi-Node Infrastructure

Learn how to horizontally scale LLM inference using open-source tools on MI300X, with vLLM, nginx, Prometheus, and Grafana.

May 30, 2025 by Jorge Parada, Eliot Li

AMD Integrates llm-d on AMD Instinct MI300X Cluster For Distributed LLM Serving

May 20, 2025 by Kenny Roche, Joe Shajrawi, Andy Luo, Anshul Gupta

Step-Video-T2V Inference with xDiT on AMD Instinct MI300X GPUs

Learn how to accelerate text-to-video generation using Step-Video-T2V, a 30B parameter T2V model, on AMD MI300X GPUs with ROCm—enabling scalable, high-fidelity video generation from text

May 15, 2025 by Wei Cai, George Wang

DataFrame Acceleration: hipDF and hipDF.pandas on AMD GPUs

This blog post demonstrates how hipDF significantly enhances and accelerates data manipulation, aggregation, and transformation tasks on AMD hardware using ROCm.

May 07, 2025 by Fabricio Flores

CuPy and hipDF on AMD: The Basics and Beyond

Learn how to deploy CuPy and hipDF on AMD GPUs. See their high-performance computing advantages, and use CuPy and hipDF in a detailed example of an investment portfolio allocation optimization using the Markowitz model.

May 06, 2025 by Fabricio Flores

Prev Page 1 of 10 Next