srush / Tensor-Puzzles
Solve puzzles. Improve your pytorch.
☆3,427Updated 7 months ago
Alternatives and similar repositories for Tensor-Puzzles:
Users that are interested in Tensor-Puzzles are comparing it to the libraries listed below
- Puzzles for learning Triton☆1,403Updated 3 months ago
- The full minitorch student suite.☆2,008Updated 6 months ago
- Solve puzzles. Learn CUDA.☆10,514Updated 5 months ago
- What would you do with 1000 H100s...☆1,001Updated last year
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,731Updated 2 months ago
- GPU programming related news and material links☆1,368Updated last month
- Material for gpu-mode lectures☆3,731Updated last week
- Tile primitives for speedy kernels☆2,042Updated this week
- ☆416Updated 4 months ago
- A PyTorch native library for large model training☆3,326Updated this week
- The Art of Debugging☆857Updated 6 months ago
- Schedule-Free Optimization in PyTorch☆2,098Updated 2 months ago
- Puzzles for exploring transformers☆332Updated last year
- Machine Learning Engineering Open Book☆12,821Updated 2 weeks ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,790Updated 2 months ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,319Updated 8 months ago
- UNet diffusion model in pure CUDA☆599Updated 7 months ago
- ☆4,058Updated 8 months ago
- Tensors, for human consumption☆1,183Updated 3 months ago
- JAX - A curated list of resources https://github.com/google/jax☆1,699Updated this week
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,251Updated 2 months ago
- NanoGPT (124M) in 3 minutes☆2,294Updated this week
- Building blocks for foundation models.☆448Updated last year
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆772Updated this week
- PyTorch native post-training library☆4,856Updated this week
- The n-gram Language Model☆1,386Updated 6 months ago
- High Quality Resources on GPU Programming/Architecture☆581Updated 6 months ago
- llama3.np is a pure NumPy implementation for Llama 3 model.☆973Updated 8 months ago
- TensorDict is a pytorch dedicated tensor container.☆879Updated this week
- Llama from scratch, or How to implement a paper without crying☆542Updated 8 months ago