modular / mojo-gpu-puzzlesLinks
Learn GPU Programming in Mojo🔥 by Solving Puzzles
☆249Updated last week
Alternatives and similar repositories for mojo-gpu-puzzles
Users that are interested in mojo-gpu-puzzles are comparing it to the libraries listed below
Sorting:
- Machine Learning library for the emerging Mojo/Python ecosystem☆297Updated 3 weeks ago
- port of Andrjey Karpathy's llm.c to Mojo☆360Updated 4 months ago
- A Machine Learning framework from scratch in Pure Mojo 🔥☆439Updated 10 months ago
- Simple MPI implementation for prototyping or learning☆292Updated 4 months ago
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆724Updated 2 weeks ago
- Where GPUs get cooked 👩🍳🔥☆326Updated 2 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆152Updated 2 years ago
- Competitive GPU kernel optimization platform.☆141Updated last week
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆196Updated 6 months ago
- A Learning Journey: Micrograd in Mojo 🔥☆63Updated last year
- Tensor library with autograd using only Rust's standard library☆70Updated last year
- SIMD quantization kernels☆93Updated 3 months ago
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆172Updated last week
- PyTorch Single Controller☆921Updated this week
- Quantized LLM training in pure CUDA/C++.☆221Updated this week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆64Updated last week
- Solve puzzles to improve your tinygrad skills!☆164Updated 2 months ago
- A working machine learning framework in pure Mojo 🔥☆130Updated last year
- ☆28Updated last year
- Implementation of Karpathy's micrograd in Mojo☆78Updated 2 years ago
- Tutorials on tinygrad☆444Updated 2 months ago
- Learning about CUDA by writing PTX code.☆149Updated last year
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆438Updated 9 months ago
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆146Updated this week
- (WIP) A small but powerful, homemade PyTorch from scratch.☆661Updated this week
- Alex Krizhevsky's original code from Google Code☆197Updated 9 years ago
- Machine Learning algorithms in pure Mojo 🔥☆59Updated 3 weeks ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆602Updated 5 months ago
- Fast and Furious AMD Kernels☆314Updated 2 weeks ago
- Dion optimizer algorithm☆403Updated this week