geohotstan / tinycorp-meetingsLinks
☆94Updated this week
Alternatives and similar repositories for tinycorp-meetings
Users that are interested in tinycorp-meetings are comparing it to the libraries listed below
Sorting:
- Solve puzzles to improve your tinygrad skills!☆145Updated 2 weeks ago
- Tutorials on tinygrad☆431Updated 2 weeks ago
- parallelized hyperdimensional tictactoe☆125Updated last year
- could we make an ml stack in 100,000 lines of code?☆46Updated last year
- Noob Lessons from Stream about how GPUs work☆129Updated 6 months ago
- The Tensor (or Array)☆451Updated last year
- If tinygrad wasn't small enough for you...☆743Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated 11 months ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- An implementation of delta-iris in tinygrad☆72Updated last year
- Simple Transformer in Jax☆139Updated last year
- Tensor library with autograd using only Rust's standard library☆70Updated last year
- Quantized LLM training in pure CUDA/C++.☆206Updated last week
- A really tiny autograd engine☆95Updated 5 months ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- Learning about CUDA by writing PTX code.☆145Updated last year
- Gradient descent is cool and all, but what if we could delete it?☆104Updated 2 months ago
- The simplest way to run LLMs anywhere☆106Updated last year
- (WIP) A small but powerful, homemade PyTorch from scratch.☆643Updated last week
- SIMD quantization kernels☆89Updated last month
- ☆42Updated 2 weeks ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆191Updated 2 years ago
- ☆96Updated last year
- High Quality Resources on GPU Programming/Architecture☆589Updated last year
- Can you design a controller to steer a simulated car?☆303Updated 3 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆146Updated 2 years ago
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆46Updated last year
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆137Updated 11 months ago
- Solve puzzles. Learn CUDA.☆64Updated last year
- ☆448Updated 6 months ago