A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
☆487Mar 10, 2025Updated last year
Alternatives and similar repositories for triton-resources
Users that are interested in triton-resources are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆453Feb 22, 2025Updated last year
- Cataloging released Triton kernels.☆308Sep 9, 2025Updated 9 months ago
- My submission for the GPUMODE/AMD fp8 mm challenge☆29Jun 4, 2025Updated last year
- Puzzles for learning Triton☆2,491Apr 1, 2026Updated 2 months ago
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆359Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- GPU Kernels☆225Apr 27, 2025Updated last year
- Triton Compiler related materials.☆44Mar 16, 2026Updated 3 months ago
- Learnings and programs related to CUDA☆438Jun 29, 2025Updated 11 months ago
- Learn CUDA with PyTorch☆330Jun 1, 2026Updated 2 weeks ago
- 100 days of building GPU kernels!☆602Apr 27, 2025Updated last year
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆924Mar 29, 2025Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆204Jun 1, 2025Updated last year
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 4 months ago
- ☆32Jul 2, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆603May 13, 2026Updated last month
- Fast low-bit matmul kernels in Triton☆471May 15, 2026Updated last month
- EquiTriton is a project that seeks to implement high-performance kernels for commonly used building blocks in equivariant neural networks…☆72May 25, 2026Updated 3 weeks ago
- A bunch of kernels that might make stuff slower 😉☆90Jun 8, 2026Updated last week
- ☆430Apr 10, 2025Updated last year
- GPU programming related news and material links☆2,174Updated this week
- DeeperGEMM: crazy optimized version☆86May 5, 2025Updated last year
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 10 months ago
- Write a fast kernel and see how you compare against the best humans and AI on gpumode.com☆99Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Efficient Triton Kernels for LLM Training☆6,444Updated this week
- Collection of kernels written in Triton language☆199Jan 27, 2026Updated 4 months ago
- ☆336Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,220Aug 26, 2025Updated 9 months ago
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆111Jun 28, 2025Updated 11 months ago
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆885Updated this week
- 🚀 Efficient implementations for emerging model architectures☆5,227Jun 11, 2026Updated last week
- Distributed Compiler based on Triton for Parallel Systems☆1,459Apr 22, 2026Updated last month
- Build compute kernels and load them from the Hub.☆691Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- coding CUDA everyday!☆77Feb 5, 2026Updated 4 months ago
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆261May 6, 2025Updated last year
- making the official triton tutorials actually comprehensible☆176May 10, 2026Updated last month
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Mar 24, 2025Updated last year
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆6,489Updated this week
- learningggggggg 🐳☆622Apr 2, 2025Updated last year
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆267Updated this week