stas00 / the-art-of-debugging
The Art of Debugging
☆861Updated 7 months ago
Alternatives and similar repositories for the-art-of-debugging:
Users that are interested in the-art-of-debugging are comparing it to the libraries listed below
- What would you do with 1000 H100s...☆1,021Updated last year
- ☆423Updated 5 months ago
- Puzzles for exploring transformers☆335Updated last year
- GPU programming related news and material links☆1,421Updated 2 months ago
- Puzzles for learning Triton☆1,527Updated 4 months ago
- An ML Systems Onboarding list☆734Updated 2 months ago
- Building blocks for foundation models.☆467Updated last year
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆783Updated 3 weeks ago
- For optimization algorithm research and development.☆501Updated this week
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆523Updated last month
- Slides, notes, and materials for the workshop☆321Updated 9 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,746Updated 3 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆948Updated 2 weeks ago
- Best practices & guides on how to write distributed pytorch training code☆373Updated last month
- Llama from scratch, or How to implement a paper without crying☆550Updated 9 months ago
- Textbook on reinforcement learning from human feedback☆492Updated this week
- Everything you want to know about Google Cloud TPU☆520Updated 8 months ago
- A puzzle to learn about prompting☆124Updated last year
- Solve puzzles. Improve your pytorch.☆3,486Updated 8 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆628Updated last month
- ☆409Updated 8 months ago
- The full minitorch student suite.☆2,034Updated 7 months ago
- Tile primitives for speedy kernels☆2,170Updated this week
- Machine Learning with Symbolic Tensors☆262Updated 3 weeks ago
- UNet diffusion model in pure CUDA☆600Updated 8 months ago
- A deep dive into embeddings starting from fundamentals☆1,004Updated 4 months ago
- ☆242Updated last year
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆370Updated this week
- TensorDict is a pytorch dedicated tensor container.☆899Updated this week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆557Updated this week