apoorvnandan / lilgrad
pytorch from scratch in pure C/CUDA and python
☆37Updated last month
Related projects ⓘ
Alternatives and complementary repositories for lilgrad
- LLM training in simple, raw C/CUDA☆87Updated 6 months ago
- High-Performance FP32 Matrix Multiplication on CPU☆301Updated last week
- parallelized hyperdimensional tictactoe☆110Updated 2 months ago
- my little linear algebra library☆45Updated 4 months ago
- Andrej Kapathy's micrograd implemented in c☆29Updated 3 months ago
- ☆47Updated 3 months ago
- Solve puzzles to improve your tinygrad skills!☆87Updated 2 months ago
- A really tiny autograd engine☆87Updated 7 months ago
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆120Updated last week
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆21Updated 4 months ago
- Nvidia Instruction Set Specification Generator☆215Updated 4 months ago
- machine learning from absolute scratch in c. gradients, linear algebra ops & everything else without using any third party library!☆21Updated 3 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆170Updated last month
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆179Updated this week
- Tensor library with autograd using only Rust's standard library☆62Updated 4 months ago
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆117Updated 3 months ago
- minimal diffusion transformer in pytorch.☆15Updated last month
- Alex Krizhevsky's original code from Google Code☆190Updated 8 years ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆167Updated 3 months ago
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆51Updated 3 months ago
- A MNIST neural network written from scratch in Odin, visualised with Raylib☆161Updated last month
- Neural network in C for recognizing american sign language(ASL) from scratch on the MNIST dataset. Optimized with parallel training. Cann…☆38Updated 2 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆107Updated last year
- Convoluting η-dimensional tensors over abstract manifolds.☆55Updated last week
- creating a tiny tensor library in raw C☆544Updated last month
- could we make an ml stack in 100,000 lines of code?☆26Updated 4 months ago
- Because tinygrad got out of hand with line count☆146Updated last month
- a tiny multidimensional array implementation in C similar to numpy, but only one file.☆221Updated 3 months ago
- The Tensor (or Array)☆411Updated 3 months ago
- An implementation of delta-iris in tinygrad☆71Updated 3 months ago