AMD-AGI/Primus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AMD-AGI/Primus)

AMD-AGI / Primus

A flexible and high-performance training framework designed for large-scale foundation model training on AMD GPUs

☆108

Alternatives and similar repositories for Primus

Users that are interested in Primus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AMD-AGI / Primus-Turbo
View on GitHub
A high-performance acceleration library dedicated to large-scale model training on AMD GPUs
☆67Updated this week
AMD-AGI / Primus-SaFE
View on GitHub
Primus-SaFE(Stability and Fault Endurance)
☆58Updated this week
AMD-AGI / maxtext-slurm
View on GitHub
Toolkit for launching and observing MaxText training on Slurm-managed GPU clusters
☆29Jul 19, 2026Updated last week
AMD-AGI / TraceLens
View on GitHub
Automating analysis from trace files
☆84Updated this week
ROCm / TransformerEngine
View on GitHub
☆72Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ROCm / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆43Updated this week
ROCm / gfx950-gluon-tutorials
View on GitHub
A practical guide to high-performance gluon kernel development on AMD GFX9 GPUs.
☆41Updated this week
ROCm / aiter
View on GitHub
AI Tensor Engine for ROCm
☆503Updated this week
ROCm / iris
View on GitHub
AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming
☆193Updated this week
ROCm / RIXL
View on GitHub
DEPRECATED REPOSITORY. ROCm Inference Transfer Library (RIXL) is a port of the NIXL library for AMD GPUs. See README_rocm.md for AMD spe…
☆15Jun 10, 2026Updated last month
ROCm / mori
View on GitHub
Modular RDMA Interface
☆157Updated this week
HazyResearch / HipKittens
View on GitHub
Fast and Furious AMD Kernels
☆446Jul 10, 2026Updated 2 weeks ago
carlushuang / gcnasm
View on GitHub
amdgpu example code in hip/asm
☆66Updated this week
AMD-AGI / torchtitan-amd
View on GitHub
A PyTorch native platform for training generative AI models
☆17Jun 30, 2026Updated 3 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ROCm / MAD
View on GitHub
MAD (Model Automation and Dashboarding)
☆39Updated this week
ROCm / rocSHMEM
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆146Updated this week
ROCm / ATOM
View on GitHub
AiTer Optimized Model
☆144Updated this week
ROCm / omnistat
View on GitHub
Scale-out system monitoring
☆25Updated this week
ROCm / DeepEP
View on GitHub
☆15Jun 30, 2026Updated 3 weeks ago
ROCm / rocmProfileData
View on GitHub
☆30Updated this week
ROCm / FlyDSL
View on GitHub
FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.
☆249Updated this week
ByteDance-Seed / Triton-distributed
View on GitHub
Distributed Compiler based on Triton for Parallel Systems
☆1,498Updated this week
AMD-AGI / GEAK
View on GitHub
Generating Efficient AI-Centric Kernels
☆131Updated this week
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
microsoft / mscclpp
View on GitHub
MSCCL++: A GPU-driven communication stack for scalable AI applications
☆542Updated this week
ROCm / rocm-xio
View on GitHub
A ROCm library for GPU-Initiated IO. This provides support for initiating IO from a ROCm-capable GPU against a range of targets including…
☆53Jul 10, 2026Updated 2 weeks ago
CentML / Mist
View on GitHub
[EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization
☆24Apr 13, 2026Updated 3 months ago
ROCm / composable_kernel
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror
☆539Updated this week
graphcore-research / unit-scaling-demo
View on GitHub
Unit Scaling demo and experimentation code
☆16Mar 12, 2024Updated 2 years ago
ROCm / rocm-libraries
View on GitHub
super repo for rocm libraries
☆390Updated this week
eunomia-bpf / cupti-tutorial
View on GitHub
Tutorials for NVIDIA CUPTI samples
☆70Updated this week
uccl-project / uccl
View on GitHub
UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g…
☆1,471Updated this week
davidhcefx / Translate-Virtual-Address-To-Physical-Address-in-Linux-Kernel
View on GitHub
Translate Virtual Address To Physical Address in Linux Kernel
☆17Dec 27, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ROCm / rocm-blogs
View on GitHub
☆81Updated this week
spcl / atlahs
View on GitHub
ATLAHS: An Application-centric Network Simulator Toolchain for AI, HPC, and Distributed Storage
☆95May 12, 2026Updated 2 months ago
ROCm / amd_matrix_instruction_calculator
View on GitHub
A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators
☆140Apr 10, 2026Updated 3 months ago
amd-enterprise-ai / solution-blueprints
View on GitHub
☆19Updated this week
ROCm / aotriton
View on GitHub
Ahead of Time (AOT) Triton Math Library
☆100Updated this week
PawseySC / rocm-from-source
View on GitHub
Scripts to build AMD ROCm from source.
☆16Oct 31, 2024Updated last year
Infrasys-AI / aiinfra-docs
View on GitHub
☆21Nov 6, 2025Updated 8 months ago