modular / mojo-gpu-puzzlesLinks
Learn GPU Programming in Mojo🔥 by Solving Puzzles
☆107Updated this week
Alternatives and similar repositories for mojo-gpu-puzzles
Users that are interested in mojo-gpu-puzzles are comparing it to the libraries listed below
Sorting:
- Composable Function Transformations in Python with Mojo/MAX acceleration☆270Updated this week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆50Updated this week
- port of Andrjey Karpathy's llm.c to Mojo☆354Updated this week
- Where GPUs get cooked 👩🍳🔥☆274Updated this week
- A Learning Journey: Micrograd in Mojo 🔥☆62Updated 9 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆137Updated last year
- Tensor library with autograd using only Rust's standard library☆68Updated last year
- ☆28Updated 11 months ago
- A Machine Learning framework from scratch in Pure Mojo 🔥☆443Updated 6 months ago
- PyTorch Single Controller☆345Updated this week
- Learning about CUDA by writing PTX code.☆133Updated last year
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆99Updated 3 weeks ago
- Simple MPI implementation for prototyping or learning☆275Updated this week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆284Updated last week
- The Tensor (or Array)☆441Updated 11 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 4 months ago
- ☆13Updated last month
- A working machine learning framework in pure Mojo 🔥☆130Updated last year
- SIMD quantization kernels☆78Updated this week
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆466Updated this week
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆189Updated 2 months ago
- ☆54Updated last week
- Machine Learning algorithms in pure Mojo 🔥☆38Updated 2 weeks ago
- High-Performance SGEMM on CUDA devices☆98Updated 6 months ago
- Competitive GPU kernel optimization platform.☆93Updated this week
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆377Updated this week
- Dion optimizer algorithm☆193Updated this week
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆71Updated this week
- Accurate, Hardware Accelerated, Special Functions in Mojo 🔥☆35Updated 8 months ago
- Solve puzzles to improve your tinygrad skills!☆141Updated 4 months ago