geohotstan / tinycorp-meetingsLinks
☆87Updated last week
Alternatives and similar repositories for tinycorp-meetings
Users that are interested in tinycorp-meetings are comparing it to the libraries listed below
Sorting:
- Solve puzzles to improve your tinygrad skills!☆135Updated 4 months ago
- Tutorials on tinygrad☆391Updated 3 weeks ago
- parallelized hyperdimensional tictactoe☆118Updated 10 months ago
- could we make an ml stack in 100,000 lines of code?☆46Updated 11 months ago
- Noob Lessons from Stream about how GPUs work☆123Updated 2 months ago
- An implementation of delta-iris in tinygrad☆72Updated 10 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆272Updated 7 months ago
- Learning about CUDA by writing PTX code.☆133Updated last year
- Simple Transformer in Jax☆138Updated last year
- The Tensor (or Array)☆437Updated 11 months ago
- Gradient descent is cool and all, but what if we could delete it?☆104Updated this week
- SIMD quantization kernels☆73Updated last week
- Can RL solve simple problems?☆54Updated last year
- The simplest way to run LLMs anywhere☆105Updated 8 months ago
- ☆448Updated 3 months ago
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆47Updated last year
- Alex Krizhevsky's original code from Google Code☆194Updated 9 years ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆173Updated 11 months ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆188Updated last year
- (WIP) A small but powerful, homemade PyTorch from scratch.☆555Updated last week
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 3 months ago
- Tensor library with autograd using only Rust's standard library☆68Updated last year
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆62Updated last year
- This repository contain the simple llama3 implementation in pure jax.☆67Updated 4 months ago
- Learnings and programs related to CUDA☆411Updated 2 weeks ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆134Updated last year
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆96Updated last month
- Can you design a controller to steer a simulated car?☆257Updated 6 months ago
- Nvidia Instruction Set Specification Generator☆280Updated last year