srush / GPU-Puzzles
Solve puzzles. Learn CUDA.
☆10,751Updated 6 months ago
Alternatives and similar repositories for GPU-Puzzles:
Users that are interested in GPU-Puzzles are comparing it to the libraries listed below
- Solve puzzles. Improve your pytorch.☆3,486Updated 8 months ago
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API☆11,453Updated 7 months ago
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆28,404Updated this week
- The full minitorch student suite.☆2,034Updated 7 months ago
- Material for gpu-mode lectures☆4,075Updated last month
- Inference Llama 2 in one file of pure C☆18,196Updated 7 months ago
- LLM training in simple, raw C/CUDA☆26,079Updated 5 months ago
- A Python framework for high performance GPU simulation and graphics☆4,773Updated this week
- Fast and memory-efficient exact attention☆16,462Updated this week
- An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.☆5,303Updated last week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,502Updated 8 months ago
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆31,708Updated this week
- Flax is a neural network library for JAX that is designed for flexibility.☆6,433Updated this week
- Puzzles for learning Triton☆1,527Updated 4 months ago
- GPU programming related news and material links☆1,421Updated 2 months ago
- NanoGPT (124M) in 3 minutes☆2,417Updated last week
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆21,632Updated 7 months ago
- llama3 implementation one matrix multiplication at a time☆14,285Updated 10 months ago
- Tensor library for machine learning☆12,165Updated last week
- An autoregressive character-level language model for making more things☆2,948Updated 9 months ago
- "Probabilistic Machine Learning" - a book series by Kevin Murphy☆5,137Updated 4 months ago
- Kolmogorov Arnold Networks☆15,526Updated 2 months ago
- The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.☆9,661Updated this week
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆37,012Updated this week
- A community-maintained Python framework for creating mathematical animations.☆30,763Updated last week
- A lightweight library for portable low-level GPU computation using WebGPU.☆3,848Updated 2 weeks ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆8,805Updated last month
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,620Updated 3 months ago
- Understanding Deep Learning - Simon J.D. Prince☆7,287Updated 2 weeks ago
- Sioyek is a PDF viewer with a focus on textbooks and research papers☆7,712Updated this week