joey00072 / Tinytorch
A really tiny autograd engine
☆89Updated 10 months ago
Alternatives and similar repositories for Tinytorch:
Users that are interested in Tinytorch are comparing it to the libraries listed below
- Simple Transformer in Jax☆136Updated 7 months ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year
- ☆141Updated last year
- Solve puzzles. Learn CUDA.☆62Updated last year
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆167Updated 6 months ago
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆51Updated 10 months ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆169Updated last year
- Alex Krizhevsky's original code from Google Code☆189Updated 8 years ago
- Collection of autoregressive model implementation☆81Updated this week
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆117Updated last year
- Andrej Kapathy's micrograd implemented in c☆29Updated 6 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 5 months ago
- ☆98Updated 9 months ago
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆46Updated 8 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆94Updated 2 months ago
- look how they massacred my boy☆63Updated 3 months ago
- An introduction to LLM Sampling☆75Updated 2 months ago
- ☆75Updated 7 months ago
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆129Updated 6 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆121Updated 9 months ago
- σ-GPT: A New Approach to Autoregressive Models☆61Updated 6 months ago
- ☆49Updated 11 months ago
- working implimention of deepseek MLA☆29Updated last month
- A puzzle to learn about prompting☆124Updated last year
- supporting pytorch FSDP for optimizers☆76Updated 2 months ago
- ☆86Updated 11 months ago
- Normalized Transformer (nGPT)☆150Updated 2 months ago
- pytorch from scratch in pure C/CUDA and python☆39Updated 4 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆63Updated 3 months ago
- Stream of my favorite papers and links☆40Updated 5 months ago