EurekaLabsAI / microgradLinks

The Autograd Engine

☆620

Alternatives and similar repositories for micrograd

Users that are interested in micrograd are comparing it to the libraries listed below

Sorting:

EurekaLabsAI / mlp
The Multilayer Perceptron Language Model
☆554Updated 11 months ago
EurekaLabsAI / tensor
The Tensor (or Array)
☆438Updated 11 months ago
EurekaLabsAI / ngram
The n-gram Language Model
☆1,437Updated 11 months ago
clu0 / unet.cu
UNet diffusion model in pure CUDA
☆612Updated last year
karpathy / nano-llama31
nanoGPT style version of Llama 3.1
☆1,401Updated 11 months ago
ulrichstern / cuda-convnet
Alex Krizhevsky's original code from Google Code
☆194Updated 9 years ago
huggingface / picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
☆1,607Updated 2 weeks ago
ash-01xor / bpe.c
Simple Byte pair Encoding mechanism used for tokenization process . written purely in C
☆134Updated 8 months ago
EleutherAI / cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆808Updated last week
mesozoic-egg / tinygrad-notes
Tutorials on tinygrad
☆394Updated last month
smolorg / smolgrad
small auto-grad engine inspired from Karpathy's micrograd and PyTorch
☆274Updated 8 months ago
Maharshi-Pandya / cudacodes
Learnings and programs related to CUDA
☆412Updated 3 weeks ago
MarioSieg / magnetron
(WIP) A small but powerful, homemade PyTorch from scratch.
☆553Updated this week
policy-gradient / GRPO-Zero
Implementing DeepSeek R1's GRPO algorithm from scratch
☆1,488Updated 3 months ago
KellerJordan / modded-nanogpt
NanoGPT (124M) in 3 minutes
☆2,851Updated last week
rkinas / triton-resources
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
☆380Updated 4 months ago
rwitten / HighPerfLLMs2024
☆514Updated last year
Laz4rz / GPT-2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
☆172Updated 11 months ago
gpu-mode / awesomeMLSys
An ML Systems Onboarding list
☆845Updated 6 months ago
LambdaLabsML / distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
☆458Updated 5 months ago
Quentin-Anthony / nanoMPI
Simple MPI implementation for prototyping or learning
☆267Updated this week
kvfrans / jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
☆279Updated last year
arpitingle / gpu-alpha
High Quality Resources on GPU Programming/Architecture
☆589Updated 11 months ago
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆897Updated 2 months ago
lucasdelimanogueira / PyNorch
Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)
☆150Updated last year
neobundy / Deep-Dive-Into-AI-With-MLX-PyTorch
"Deep Dive into AI with MLX and PyTorch" is an educational initiative designed to help anyone interested in AI, specifically in machine l…
☆483Updated 2 months ago
bkitano / llama-from-scratch
Llama from scratch, or How to implement a paper without crying
☆572Updated last year
hkproj / 100-days-of-gpu
☆352Updated 3 months ago
gautierdag / bpeasy
Fast bare-bones BPE for modern tokenizer training
☆159Updated last month
mlops-discord / gpu-optimization-workshop
Slides, notes, and materials for the workshop
☆327Updated last year