Development repository for the Triton language and compiler
☆144Apr 10, 2026Updated this week
Alternatives and similar repositories for triton
Users that are interested in triton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast and memory-efficient exact attention☆227Apr 9, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆112Apr 7, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror☆526Updated this week
- Ahead of Time (AOT) Triton Math Library☆96Apr 8, 2026Updated last week
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆209Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆139Apr 9, 2026Updated last week
- ☆173Updated this week
- ☆66Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆26Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆116Apr 9, 2026Updated last week
- AI Tensor Engine for ROCm☆402Updated this week
- CMake modules used within the ROCm libraries☆74Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆12Jun 24, 2024Updated last year
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Automating analysis from trace files☆66Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆165Mar 31, 2026Updated 2 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆153Mar 31, 2026Updated 2 weeks ago
- AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆183Apr 9, 2026Updated last week
- ☆20Oct 11, 2023Updated 2 years ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆23Apr 3, 2026Updated last week
- 8-bit CUDA functions for PyTorch☆72Sep 24, 2025Updated 6 months ago
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆26Updated this week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Apr 9, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- python package of rocm-smi-lib☆24Dec 15, 2025Updated 4 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Mar 31, 2026Updated 2 weeks ago
- AMD's graph optimization engine.☆290Updated this week
- ☆50Apr 7, 2026Updated last week
- Development containers for triton and triton-cpu☆27Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆255Updated this week
- ☆54Mar 15, 2025Updated last year
- Inference Llama 2 with a model compiled to native code by TorchInductor☆14Feb 8, 2024Updated 2 years ago
- Framework to reduce autotune overhead to zero for well known deployments.☆98Sep 19, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- FlagGems is an operator library for large language models implemented in the Triton Language.☆953Updated this week
- extensible collectives library in triton☆98Mar 31, 2025Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆390Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆151Apr 7, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆270Apr 8, 2026Updated last week
- A Triton-only attention backend for vLLM☆25Mar 17, 2026Updated 3 weeks ago
- 8-bit CUDA functions for PyTorch Rocm compatible☆41Mar 26, 2024Updated 2 years ago