joey00072 / Tinytorch
A really tiny autograd engine
☆90Updated 11 months ago
Alternatives and similar repositories for Tinytorch:
Users that are interested in Tinytorch are comparing it to the libraries listed below
- Simple Transformer in Jax☆136Updated 9 months ago
- Andrej Kapathy's micrograd implemented in c☆28Updated 7 months ago
- Solve puzzles. Learn CUDA.☆63Updated last year
- Collection of autoregressive model implementation☆83Updated last month
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆126Updated last year
- ☆23Updated 7 months ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆170Updated 7 months ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆173Updated last year
- Fast bare-bones BPE for modern tokenizer training☆149Updated 5 months ago
- ☆98Updated 11 months ago
- Learning about CUDA by writing PTX code.☆124Updated last year
- Alex Krizhevsky's original code from Google Code☆190Updated 9 years ago
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- ☆76Updated 8 months ago
- look how they massacred my boy☆63Updated 5 months ago
- run paligemma in real time☆131Updated 10 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 6 months ago
- Simplex Random Feature attention, in PyTorch☆74Updated last year
- ☆27Updated 8 months ago
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆46Updated 9 months ago
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆129Updated 4 months ago
- ☆149Updated last year
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆87Updated last year
- ☆49Updated last year
- ☆60Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year