A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
☆467Mar 10, 2025Updated last year
Alternatives and similar repositories for triton-resources
Users that are interested in triton-resources are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆440Feb 22, 2025Updated last year
- Cataloging released Triton kernels.☆300Sep 9, 2025Updated 6 months ago
- My submission for the GPUMODE/AMD fp8 mm challenge☆29Jun 4, 2025Updated 9 months ago
- Puzzles for learning Triton☆2,348Mar 18, 2026Updated last week
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆335Updated this week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- GPU Kernels☆223Apr 27, 2025Updated 11 months ago
- Triton Compiler related materials.☆42Mar 16, 2026Updated 2 weeks ago
- Learnings and programs related to CUDA☆435Jun 29, 2025Updated 9 months ago
- 100 days of building GPU kernels!☆581Apr 27, 2025Updated 11 months ago
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆878Mar 29, 2025Updated last year
- Learn CUDA with PyTorch☆257Updated this week
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆201Jun 1, 2025Updated 9 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- ☆32Jul 2, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆598Aug 12, 2025Updated 7 months ago
- Fast low-bit matmul kernels in Triton☆438Feb 1, 2026Updated last month
- EquiTriton is a project that seeks to implement high-performance kernels for commonly used building blocks in equivariant neural networks…☆69Mar 9, 2026Updated 3 weeks ago
- A bunch of kernels that might make stuff slower 😉☆85Updated this week
- ☆420Apr 10, 2025Updated 11 months ago
- DeeperGEMM: crazy optimized version☆75May 5, 2025Updated 10 months ago
- Write a fast kernel and see how you compare against the best humans and AI on gpumode.com☆90Updated this week
- GPU programming related news and material links☆2,060Mar 8, 2026Updated 3 weeks ago
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 7 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Collection of kernels written in Triton language☆185Jan 27, 2026Updated 2 months ago
- ☆310Mar 22, 2026Updated last week
- Efficient Triton Kernels for LLM Training☆6,242Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,119Aug 26, 2025Updated 7 months ago
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆106Jun 28, 2025Updated 9 months ago
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆809Mar 23, 2026Updated last week
- Build compute kernels and load them from the Hub.☆536Updated this week
- 🚀 Efficient implementations of state-of-the-art linear attention models☆4,692Updated this week
- making the official triton tutorials actually comprehensible☆140Aug 25, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Distributed Compiler based on Triton for Parallel Systems☆1,398Mar 11, 2026Updated 2 weeks ago
- coding CUDA everyday!☆74Feb 5, 2026Updated last month
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆255May 6, 2025Updated 10 months ago
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆5,432Updated this week
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Mar 24, 2025Updated last year
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆240Mar 23, 2026Updated last week
- learningggggggg 🐳☆615Apr 2, 2025Updated 11 months ago