mesozoic-egg / tinygrad-notesLinks

Tutorials on tinygrad

☆396

Alternatives and similar repositories for tinygrad-notes

Users that are interested in tinygrad-notes are comparing it to the libraries listed below

Sorting:

obadakhalili / tinygrad-tensor-puzzles
Solve puzzles to improve your tinygrad skills!
☆140Updated 4 months ago
tinygrad / teenygrad
If tinygrad wasn't small enough for you...
☆725Updated last year
geohotstan / tinycorp-meetings
☆87Updated last week
arpitingle / gpu-alpha
High Quality Resources on GPU Programming/Architecture
☆588Updated last year
EurekaLabsAI / tensor
The Tensor (or Array)
☆441Updated 11 months ago
spikedoanz / tensor-tic-tac-toe
parallelized hyperdimensional tictactoe
☆118Updated 11 months ago
spikedoanz / from-bits-to-intelligence
could we make an ml stack in 100,000 lines of code?
☆46Updated last year
commaai / controls_challenge
Can you design a controller to steer a simulated car?
☆267Updated last week
smolorg / smolgrad
small auto-grad engine inspired from Karpathy's micrograd and PyTorch
☆274Updated 8 months ago
clu0 / unet.cu
UNet diffusion model in pure CUDA
☆613Updated last year
linjames0 / Transformer-CUDA
An implementation of the transformer architecture onto an Nvidia CUDA kernel
☆189Updated last year
Quentin-Anthony / nanoMPI
Simple MPI implementation for prototyping or learning
☆272Updated 2 weeks ago
Laz4rz / GPT-2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
☆172Updated last year
MarioSieg / magnetron
(WIP) A small but powerful, homemade PyTorch from scratch.
☆554Updated this week
siboehm / ShallowSpeed
Small scale distributed training of sequential deep learning models, built on Numpy and MPI.
☆137Updated last year
unixpickle / learn-ptx
Learning about CUDA by writing PTX code.
☆133Updated last year
ulrichstern / cuda-convnet
Alex Krizhevsky's original code from Google Code
☆195Updated 9 years ago
salykova / sgemm.c
Multi-Threaded FP32 Matrix Multiplication on x86 CPUs
☆350Updated 3 months ago
tugot17 / pmpp
Complete solutions to the Programming Massively Parallel Processors Edition 4
☆444Updated last month
Maharshi-Pandya / cudacodes
Learnings and programs related to CUDA
☆414Updated last month
jla524 / fromthetensor
From the Tensor to Stable Diffusion, a rough outline for a 1 week course.
☆1,067Updated last week
Quentin-Anthony / torch-profiling-tutorial
☆447Updated 2 weeks ago
EurekaLabsAI / micrograd
The Autograd Engine
☆623Updated 10 months ago
smolorg / smolar
a tiny multidimensional array implementation in C similar to numpy, but only one file.
☆228Updated last year
tinygrad / gpuctypes
ctypes wrappers for HIP, CUDA, and OpenCL
☆130Updated last year
tinygrad / 7900xtx
☆449Updated 4 months ago
srush / Transformer-Puzzles
Puzzles for exploring transformers
☆356Updated 2 years ago
srush / Autodiff-Puzzles
☆443Updated 9 months ago
rkinas / triton-resources
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
☆383Updated 4 months ago
ash-01xor / bpe.c
Simple Byte pair Encoding mechanism used for tokenization process . written purely in C
☆134Updated 8 months ago