stas00 / the-art-of-debugging
The Art of Debugging
☆841Updated 5 months ago
Alternatives and similar repositories for the-art-of-debugging:
Users that are interested in the-art-of-debugging are comparing it to the libraries listed below
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆758Updated last week
- What would you do with 1000 H100s...☆970Updated last year
- ☆413Updated 3 months ago
- Building blocks for foundation models.☆440Updated last year
- GPU programming related news and material links☆1,347Updated 3 weeks ago
- Puzzles for exploring transformers☆331Updated last year
- Puzzles for learning Triton☆1,337Updated 2 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆670Updated this week
- An ML Systems Onboarding list☆664Updated this week
- Best practices & guides on how to write distributed pytorch training code☆342Updated this week
- ☆237Updated 10 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,722Updated last month
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,268Updated this week
- For optimization algorithm research and development.☆486Updated last week
- UNet diffusion model in pure CUDA☆596Updated 7 months ago
- Slides, notes, and materials for the workshop☆310Updated 7 months ago
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆379Updated last week
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆341Updated 6 months ago
- The full minitorch student suite.☆1,992Updated 5 months ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆511Updated this week
- The Tensor (or Array)☆420Updated 5 months ago
- High Quality Resources on GPU Programming/Architecture☆578Updated 6 months ago
- LLM papers I'm reading, mostly on inference and model compression☆707Updated last year
- Tile primitives for speedy kernels☆1,966Updated this week
- ☆203Updated 6 months ago
- Notes from the Latent Space paper club. Follow along or start your own!☆221Updated 5 months ago
- A pure NumPy implementation of Mamba.☆219Updated 6 months ago
- The Multilayer Perceptron Language Model☆533Updated 5 months ago
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,242Updated last month
- TensorDict is a pytorch dedicated tensor container.☆868Updated this week