ROCm/triton

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ROCm/triton)

ROCm / triton

Development repository for the Triton language and compiler

☆146

Alternatives and similar repositories for triton

Users that are interested in triton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ROCm / aotriton
View on GitHub
Ahead of Time (AOT) Triton Math Library
☆100Updated this week
ROCm / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆234Jul 16, 2026Updated last week
ROCm / hipBLASLt
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆114Updated this week
ROCm / composable_kernel
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror
☆539Updated this week
ROCm / rocWMMA
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆140Jul 13, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ROCm / TransformerEngine
View on GitHub
☆72Updated this week
ROCm / rocprof-trace-decoder
View on GitHub
☆17Apr 10, 2026Updated 3 months ago
ROCm / aiter
View on GitHub
AI Tensor Engine for ROCm
☆503Updated this week
ROCm / rocMLIR
View on GitHub
☆185Updated this week
ROCm / llvm-project
View on GitHub
This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…
☆225Updated this week
ROCm / hipRAND
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆27Jul 6, 2026Updated 2 weeks ago
ROCm / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆122Updated this week
CRobeck / instrument-amdgpu-kernels
View on GitHub
LLVM/MLIR based compiler instrumentation of AMD GPU kernels
☆21Jul 13, 2025Updated last year
ROCm / rocm-cmake
View on GitHub
CMake modules used within the ROCm libraries
☆77Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ROCm / AITemplate
View on GitHub
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…
☆12Jun 24, 2024Updated 2 years ago
ROCm / amd_matrix_instruction_calculator
View on GitHub
A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators
☆140Apr 10, 2026Updated 3 months ago
ROCm / pyrsmi
View on GitHub
python package of rocm-smi-lib
☆25Dec 15, 2025Updated 7 months ago
ROCm / rocprof-compute-viewer
View on GitHub
☆62Jul 16, 2026Updated last week
ROCm / FlyDSL
View on GitHub
FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.
☆249Updated this week
ROCm / iris
View on GitHub
AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming
☆193Updated this week
ROCm / rocprofiler
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆152May 28, 2026Updated last month
ROCm / rocprofiler-compute
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆165May 28, 2026Updated last month
ptillet / triton-llvm-releases
View on GitHub
☆20Oct 11, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
EmbeddedLLM / vllm
View on GitHub
vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
☆96Updated this week
ROCm / bitsandbytes
View on GitHub
8-bit CUDA functions for PyTorch
☆72Updated this week
ROCm / roctracer
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆83May 28, 2026Updated last month
ROCm / apex
View on GitHub
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
☆24Jul 1, 2026Updated 3 weeks ago
ROCm / xformers
View on GitHub
Hackable and optimized Transformers building blocks, supporting a composable construction.
☆34May 29, 2026Updated last month
seb-v / fp32_sgemm_amd
View on GitHub
Super fast FP32 matrix multiplication on RDNA3
☆92Mar 30, 2025Updated last year
ROCm / Tensile
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆260Updated this week
flagos-ai / FlagGems
View on GitHub
FlagGems is an operator library for large language models implemented in the Triton Language.
☆1,057Updated this week
ROCm / jax
View on GitHub
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
☆30Updated this week
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ROCm / AMDMIGraphX
View on GitHub
AMD's graph optimization engine.
☆318Updated this week
bertmaher / llama2.so
View on GitHub
Inference Llama 2 with a model compiled to native code by TorchInductor
☆14Feb 8, 2024Updated 2 years ago
ROCm / rocBLAS
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆396Jul 15, 2026Updated last week
cchan / tccl
View on GitHub
extensible collectives library in triton
☆97Mar 31, 2025Updated last year
ROCm / hipBLAS
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆151Updated this week
IBM / triton-dejavu
View on GitHub
Framework to reduce autotune overhead to zero for well known deployments.
☆102Sep 19, 2025Updated 10 months ago
ROCm / ROCR-Runtime
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆276Updated this week