Jokeren/triton-samples

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jokeren/triton-samples)

Jokeren / triton-samples

☆29

Alternatives and similar repositories for triton-samples

Users that are interested in triton-samples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Deep-Learning-Profiling-Tools / triton-samples
View on GitHub
☆14Mar 8, 2025Updated last year
facebookexperimental / protoquant
View on GitHub
Prototype routines for GPU quantization written using PyTorch.
☆21Apr 15, 2026Updated 3 months ago
haileyschoelkopf / triton-index
View on GitHub
See https://github.com/cuda-mode/triton-index/ instead!
☆11May 8, 2024Updated 2 years ago
daemyung / practice-triton
View on GitHub
삼각형의 실전! Triton
☆16Feb 15, 2024Updated 2 years ago
Kernel-Machines / kermac
View on GitHub
Pytorch routines for (Ker)nel (Mac)hines
☆12Oct 10, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Deep-Learning-Profiling-Tools / triton-viz
View on GitHub
☆350Jul 16, 2026Updated last week
UmerHA / triton_util
View on GitHub
Make triton easier
☆49Jun 12, 2024Updated 2 years ago
mlc-ai / package
View on GitHub
☆14Updated this week
sukoncon / TMA-Adaptive-FP8-Grouped-GEMM
View on GitHub
☆27Aug 28, 2025Updated 10 months ago
FindHao / drgpu
View on GitHub
A Top-Down Profiler for GPU Applications
☆23Feb 29, 2024Updated 2 years ago
sdpython / onnxcustom
View on GitHub
Tutorial on how to convert machine learned models into ONNX
☆14Mar 11, 2023Updated 3 years ago
meta-pytorch / FACTO
View on GitHub
Framework for Algorithmic Correctness Testing of Operators
☆16Mar 9, 2026Updated 4 months ago
mikex86 / tritonc
View on GitHub
Standalone commandline CLI tool for compiling Triton kernels
☆20Sep 13, 2024Updated last year
meta-pytorch / MSLK
View on GitHub
MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI tr…
☆121Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
thuml / learn_torch.compile
View on GitHub
torch.compile artifacts for common deep learning models, can be used as a learning resource for torch.compile
☆19Dec 22, 2023Updated 2 years ago
thecharlieblake / lovely-llama
View on GitHub
An implementation of the Llama architecture, to instruct and delight
☆21May 31, 2025Updated last year
ai-ar-research / Lemur-program-verification
View on GitHub
A verifier that integrates LLMs into automated C program verification
☆16Apr 4, 2026Updated 3 months ago
AkiraHakuta / antlr4_Cpp_examples
View on GitHub
Some examples of the Cpp target.
☆10Nov 7, 2023Updated 2 years ago
k4m4 / hex-cli
View on GitHub
Hex encode & decode a string, right from your terminal.
☆10Jan 5, 2023Updated 3 years ago
IaroslavElistratov / triton-autodiff
View on GitHub
☆19Nov 11, 2025Updated 8 months ago
kyouko-taiga / Diesel
View on GitHub
A Swift library to write parsers for domain specific languages.
☆15Nov 3, 2020Updated 5 years ago
mlampros / nmslibR
View on GitHub
Non Metric Space ( Approximate ) Library in R
☆12Feb 2, 2023Updated 3 years ago
alexzhang13 / Triton-Puzzles-Solutions
View on GitHub
Personal solutions to the Triton Puzzles
☆21Jul 18, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lewislepton / raccoon
View on GitHub
raccoon engine for kha [formerly lkl]
☆12Sep 2, 2019Updated 6 years ago
sgugger / torchdynamo-tests
View on GitHub
☆20Nov 23, 2022Updated 3 years ago
TheJDen / janestreet-gpu-mode-2025
View on GitHub
An educational walkthrough for the 9/6 Hackathon
☆18Mar 5, 2026Updated 4 months ago
yifuwang / symm-mem-recipes
View on GitHub
☆170Dec 27, 2024Updated last year
meta-pytorch / kraken
View on GitHub
Triton-based Symmetric Memory operators and examples
☆106May 15, 2026Updated 2 months ago
jax-ml / jax-triton
View on GitHub
jax-triton contains integrations between JAX and OpenAI Triton
☆465Updated this week
ekondis / gpuroofperf-toolkit
View on GitHub
A GPU performance prediction toolkit for CUDA programs
☆18Mar 25, 2019Updated 7 years ago
Samsung / veles.simd
View on GitHub
Distributed machine learning platform
☆13Aug 20, 2015Updated 10 years ago
open-lm-engine / accelerated-model-architectures
View on GitHub
☆91Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
lianakoleva / no-libtorch-compile
View on GitHub
☆21Mar 3, 2025Updated last year
Wren6991 / DOOMSoC
View on GitHub
A SoC for DOOM
☆20Apr 11, 2021Updated 5 years ago
xnning / dependent-types-in-haskell
View on GitHub
Demo code for the talk Dependent Types in Haskell in Hong Kong Functional Programming meetup
☆16Dec 13, 2018Updated 7 years ago
91varunsharma / Multilevel-Cache-Simulator
View on GitHub
Implemented a two-level (L1 and L2) cache simulator in C++ with round robin eviction policy
☆10Jan 4, 2017Updated 9 years ago
zslwyuan / Zynq_HLS_DDR_Dataflow_kernel_2mm
View on GitHub
This is a project integrating HLS IP and CortexA9 on Zynq. This CPU-FPGA project, for a Matrix Multiplication Dataflow, is implemented wi…
☆21Sep 3, 2019Updated 6 years ago
GeeeekExplorer / 3d-parallel-demo
View on GitHub
使用torch.distributed实现DP/TP/PP
☆15Dec 28, 2023Updated 2 years ago
PerfVec / PerfVec
View on GitHub
A generalizable machine learning-based performance modeling framework.
☆20Jun 9, 2025Updated last year