modular / mojo-gpu-puzzlesLinks
Learn GPU Programming in Mojo🔥 by Solving Puzzles
☆284Updated this week
Alternatives and similar repositories for mojo-gpu-puzzles
Users that are interested in mojo-gpu-puzzles are comparing it to the libraries listed below
Sorting:
- Machine Learning library for the emerging Mojo/Python ecosystem☆316Updated last week
- Competitive GPU kernel optimization platform.☆146Updated last week
- port of Andrjey Karpathy's llm.c to Mojo☆362Updated 5 months ago
- A Machine Learning framework from scratch in Pure Mojo 🔥☆441Updated last year
- A Learning Journey: Micrograd in Mojo 🔥☆65Updated last year
- Simple MPI implementation for prototyping or learning☆299Updated 5 months ago
- A working machine learning framework in pure Mojo 🔥☆129Updated last year
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆155Updated 2 years ago
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆818Updated last week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆68Updated this week
- SIMD quantization kernels☆94Updated 4 months ago
- Quantized LLM training in pure CUDA/C++.☆233Updated last week
- ☆29Updated last year
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆194Updated this week
- Learning about CUDA by writing PTX code.☆151Updated last year
- PyTorch Single Controller☆953Updated this week
- Solve puzzles to improve your tinygrad skills!☆177Updated 3 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆195Updated 7 months ago
- Where GPUs get cooked 👩🍳🔥☆357Updated last week
- Machine Learning algorithms in pure Mojo 🔥☆62Updated last week
- NuMojo is a library for numerical computing in Mojo 🔥 similar to numpy in Python.☆199Updated this week
- Alex Krizhevsky's original code from Google Code☆199Updated 9 years ago
- A curriculum for learning about gpu performance engineering, from scratch to what the frontier AI labs do☆301Updated 2 weeks ago
- Tensor library with autograd using only Rust's standard library☆71Updated last year
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆457Updated 10 months ago
- Implementation of Karpathy's micrograd in Mojo☆77Updated 2 years ago
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆114Updated last month
- (WIP) A small but powerful, homemade PyTorch from scratch.☆672Updated this week
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆202Updated 2 years ago
- Tutorials on tinygrad☆453Updated 3 months ago