geohot / body_loopLinks
comma body does a loop around the office
☆27Updated last year
Alternatives and similar repositories for body_loop
Users that are interested in body_loop are comparing it to the libraries listed below
Sorting:
- parallelized hyperdimensional tictactoe☆125Updated last year
- Scripts and environment for the tinybox☆94Updated last year
- Can RL solve simple problems?☆54Updated last year
- An implementation of delta-iris in tinygrad☆72Updated last year
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- If tinygrad wasn't small enough for you...☆743Updated last year
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated 2 years ago
- Alex Krizhevsky's original code from Google Code☆199Updated 9 years ago
- Tutorials on tinygrad☆431Updated 2 weeks ago
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi …☆350Updated last year
- commaVQ is a dataset of compressed driving video☆330Updated last month
- nice and effective super simple calorie counter web app☆101Updated last year
- A really tiny autograd engine☆95Updated 5 months ago
- ☆94Updated last week
- Noob Lessons from Stream about how GPUs work☆130Updated 6 months ago
- The Tensor (or Array)☆451Updated last year
- Solve puzzles to improve your tinygrad skills!☆145Updated 2 weeks ago
- could we make an ml stack in 100,000 lines of code?☆46Updated last year
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆137Updated 11 months ago
- Learning about CUDA by writing PTX code.☆145Updated last year
- Simple Transformer in Jax☆139Updated last year
- Letting computers listen to you and really care☆371Updated 3 years ago
- Fast bare-bones BPE for modern tokenizer training☆167Updated 4 months ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆191Updated 2 years ago
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆63Updated last year
- ☆448Updated 6 months ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆367Updated 6 months ago
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆52Updated last year
- Gradient descent is cool and all, but what if we could delete it?☆104Updated 2 months ago
- Quantized LLM training in pure CUDA/C++.☆209Updated this week