modular / mojo-gpu-puzzlesLinks
Learn GPU Programming in Mojo🔥 by Solving Puzzles
☆187Updated last week
Alternatives and similar repositories for mojo-gpu-puzzles
Users that are interested in mojo-gpu-puzzles are comparing it to the libraries listed below
Sorting:
- Machine Learning library for the emerging Mojo/Python ecosystem☆285Updated last week
- port of Andrjey Karpathy's llm.c to Mojo☆358Updated 2 months ago
- A Machine Learning framework from scratch in Pure Mojo 🔥☆440Updated 9 months ago
- Learning about CUDA by writing PTX code.☆145Updated last year
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆146Updated 2 years ago
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆666Updated last week
- ☆28Updated last year
- Simple MPI implementation for prototyping or learning☆286Updated 2 months ago
- A working machine learning framework in pure Mojo 🔥☆130Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆58Updated 2 weeks ago
- Tensor library with autograd using only Rust's standard library☆70Updated last year
- Quantized LLM training in pure CUDA/C++.☆209Updated this week
- A Learning Journey: Micrograd in Mojo 🔥☆63Updated last year
- Solve puzzles to improve your tinygrad skills!☆145Updated 2 weeks ago
- The Tensor (or Array)☆451Updated last year
- High-Performance SGEMM on CUDA devices☆107Updated 9 months ago
- Competitive GPU kernel optimization platform.☆113Updated this week
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆137Updated last month
- Machine Learning algorithms in pure Mojo 🔥☆55Updated last week
- PyTorch Single Controller☆840Updated this week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆305Updated this week
- Learn CUDA with PyTorch☆95Updated last month
- SIMD quantization kernels☆89Updated last month
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆194Updated 5 months ago
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆98Updated 2 weeks ago
- 📖 Learn some mojo !☆135Updated last year
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆192Updated 2 years ago
- Accurate, Hardware Accelerated, Special Functions in Mojo 🔥☆35Updated 10 months ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆560Updated 4 months ago
- ☆52Updated 2 months ago