Development repository for the Triton language and compiler
☆144Apr 30, 2026Updated this week
Alternatives and similar repositories for triton
Users that are interested in triton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast and memory-efficient exact attention☆230Apr 27, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror☆529Updated this week
- Ahead of Time (AOT) Triton Math Library☆97Apr 17, 2026Updated 2 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆140Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆214Updated this week
- ☆177Updated this week
- ☆67Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆26Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆118Apr 29, 2026Updated last week
- AI Tensor Engine for ROCm☆420Updated this week
- CMake modules used within the ROCm libraries☆74Updated this week
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 2 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆166Apr 14, 2026Updated 3 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆153Apr 14, 2026Updated 3 weeks ago
- Automating analysis from trace files☆74Updated this week
- ☆20Oct 11, 2023Updated 2 years ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆24Apr 20, 2026Updated 2 weeks ago
- AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆188Apr 29, 2026Updated last week
- 8-bit CUDA functions for PyTorch☆72Sep 24, 2025Updated 7 months ago
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆26Updated this week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆94Apr 29, 2026Updated last week
- python package of rocm-smi-lib☆24Dec 15, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Apr 14, 2026Updated 3 weeks ago
- AMD's graph optimization engine.☆295Updated this week
- Development containers for triton and triton-cpu☆27Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆258Updated this week
- ☆53Mar 15, 2025Updated last year
- ☆54Apr 23, 2026Updated last week
- Inference Llama 2 with a model compiled to native code by TorchInductor☆14Feb 8, 2024Updated 2 years ago
- hipDF - GPU DataFrame Library☆16Mar 16, 2026Updated last month
- Framework to reduce autotune overhead to zero for well known deployments.☆99Sep 19, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- extensible collectives library in triton☆98Mar 31, 2025Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆393Apr 29, 2026Updated last week
- FlagGems is an operator library for large language models implemented in the Triton Language.☆981Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆152Apr 28, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆270Apr 28, 2026Updated last week
- A Triton-only attention backend for vLLM☆25Mar 17, 2026Updated last month
- 8-bit CUDA functions for PyTorch Rocm compatible☆42Mar 26, 2024Updated 2 years ago