stas00 / the-art-of-debugging
The Art of Debugging
☆875Updated 8 months ago
Alternatives and similar repositories for the-art-of-debugging:
Users that are interested in the-art-of-debugging are comparing it to the libraries listed below
- What would you do with 1000 H100s...☆1,038Updated last year
- ☆428Updated 6 months ago
- Puzzles for exploring transformers☆343Updated last year
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆786Updated last month
- Building blocks for foundation models.☆482Updated last year
- GPU programming related news and material links☆1,461Updated 3 months ago
- Puzzles for learning Triton☆1,591Updated 5 months ago
- An ML Systems Onboarding list☆756Updated 3 months ago
- Best practices & guides on how to write distributed pytorch training code☆401Updated 2 months ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆534Updated this week
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,759Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆991Updated last month
- ☆424Updated 9 months ago
- Thunder gives you PyTorch models superpowers for training and inference. Unlock out-of-the-box optimizations for performance, memory and …☆1,328Updated this week
- Machine Learning with Symbolic Tensors☆267Updated last month
- UNet diffusion model in pure CUDA☆602Updated 9 months ago
- Slides, notes, and materials for the workshop☆324Updated 10 months ago
- TensorDict is a pytorch dedicated tensor container.☆911Updated this week
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆407Updated this week
- Llama from scratch, or How to implement a paper without crying☆558Updated 10 months ago
- Tile primitives for speedy kernels☆2,279Updated this week
- (WIP) A small but powerful, homemade PyTorch from scratch.☆543Updated last week
- 🤖 A PyTorch library of curated Transformer models and their composable components☆884Updated last year
- For optimization algorithm research and development.☆508Updated this week
- The full minitorch student suite.☆2,055Updated 8 months ago
- Annotated version of the Mamba paper☆483Updated last year
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,184Updated last week
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,252Updated 4 months ago
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…☆286Updated 4 months ago
- Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.☆576Updated last month