Jaykef/Triton-nanoGPT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jaykef/Triton-nanoGPT)

Jaykef / Triton-nanoGPT

Custom triton kernels for training Karpathy's nanoGPT.

☆19

Alternatives and similar repositories for Triton-nanoGPT

Users that are interested in Triton-nanoGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dayal-kalra / low-memory-adam
View on GitHub
☆14Mar 2, 2025Updated last year
herrmann / rustorch
View on GitHub
"PyTorch in Rust"
☆17Feb 13, 2024Updated 2 years ago
justindomke / numbat
View on GitHub
NumPy+Jax with named axes and an uncompromising attitude
☆23Mar 4, 2025Updated last year
mingyin0312 / RL4LLM
View on GitHub
RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct
☆31Feb 23, 2025Updated last year
kvfrans / matrix-whitening
View on GitHub
Code for "What really matters in matrix-whitening optimizers?"
☆25Oct 31, 2025Updated 8 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
inferless / deepseek-r1-distill-qwen-32b
View on GitHub
A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gp…
☆16Mar 11, 2025Updated last year
naklecha / llm-inference-optimizations-explained
View on GitHub
in this repository, i'm going to implement increasingly complex llm inference optimizations
☆86May 22, 2025Updated last year
ALucek / GRPO-Training
View on GitHub
An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆38May 18, 2025Updated last year
Dao-AILab / gemm-cublas
View on GitHub
☆22May 5, 2025Updated last year
stmilab / ArterialNet
View on GitHub
ArterialNet reconstructs arterial blood pressure (ABP) waveform
☆14Feb 24, 2025Updated last year
PGomes92 / kubios
View on GitHub
Python package to export NN/RR interval series in KUBIOS HRV readable format and to import HRV results from KUBIOS report files in .txt f…
☆12Jan 4, 2019Updated 7 years ago
zhaoyanpeng / cpcfg
View on GitHub
Fast and Modularized CFG-focused Models
☆23Nov 8, 2023Updated 2 years ago
j-towns / scanagram
View on GitHub
Tidy autoregressive inference in JAX
☆15Sep 1, 2025Updated 10 months ago
GSYfate / knnlm-limits
View on GitHub
Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"
☆24Apr 30, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Cadenza-Labs / sleeper-agents
View on GitHub
☆15Jul 12, 2024Updated 2 years ago
lucidrains / ttt-rl
View on GitHub
Reinforcement Learning example in Nim, playing tic tac toe. Based off original C version from the great Antirez
☆15Apr 2, 2025Updated last year
brendano / myutil
View on GitHub
☆23Dec 15, 2020Updated 5 years ago
vvvm23 / mamba-jax
View on GitHub
Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX
☆94Jan 25, 2024Updated 2 years ago
lucidrains / simplicial-attention
View on GitHub
Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…
☆49Sep 2, 2025Updated 10 months ago
sdan / nanoEBM
View on GitHub
minimal Energy-based transformer
☆44Dec 11, 2025Updated 7 months ago
FLAIROx / jafar
View on GitHub
JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"
☆107Jan 23, 2025Updated last year
HuyNguyen-hust / hopper-gemm-101
View on GitHub
☆14Dec 22, 2024Updated last year
BobMcDear / attorch
View on GitHub
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
☆605May 13, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JonasGeiping / dataaugs
View on GitHub
☆18Oct 12, 2022Updated 3 years ago
apple / ml-np-rasp
View on GitHub
☆22Jan 19, 2024Updated 2 years ago
sdslabs / bruter
View on GitHub
brute but stronger
☆11Jun 25, 2026Updated last month
srush / Hierarchical-Bayes-Compiler
View on GitHub
Hal Daume's hbc
☆20Jan 23, 2010Updated 16 years ago
itsdaniele / speculative_mamba
View on GitHub
☆18Nov 28, 2024Updated last year
soheil-mp / Reinforcement-Learning-Algorithms
View on GitHub
Step by Step Reinforcement Learning Tutorials.
☆12Nov 19, 2022Updated 3 years ago
Ping-C / optimizer
View on GitHub
This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…
☆39Mar 2, 2023Updated 3 years ago
HarleyCoops / smolThinker-.5B
View on GitHub
A Qwen .5B reasoning model trained on OpenR1-Math-220k
☆14Jul 22, 2026Updated last week
ai-wand / concise-reasoning
View on GitHub
Concise Reasoning via Reinforcement Learning
☆13Apr 16, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ZhaofengWu / SIFT
View on GitHub
☆14Aug 3, 2022Updated 3 years ago
cocoa-org / NanoRollout
View on GitHub
Scale digital agent rollouts without pain.
☆34Jun 18, 2026Updated last month
AI-Hypercomputer / cloud-diagnostics-xprof
View on GitHub
☆17Jan 23, 2026Updated 6 months ago
changjonathanc / flex-nano-vllm
View on GitHub
FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.
☆356Nov 2, 2025Updated 8 months ago
young-geng / mintext
View on GitHub
Minimal but scalable implementation of large language models in JAX
☆34Nov 28, 2025Updated 8 months ago
FLAIROx / popjym
View on GitHub
POPGym Library in JAX
☆14Apr 15, 2024Updated 2 years ago
mayank31398 / ladder-residual-inference
View on GitHub
☆14Jul 13, 2025Updated last year