☆551Aug 6, 2025Updated 6 months ago
Alternatives and similar repositories for torch-profiling-tutorial
Users that are interested in torch-profiling-tutorial are comparing it to the libraries listed below
Sorting:
- Code for "What really matters in matrix-whitening optimizers?"☆21Oct 31, 2025Updated 4 months ago
- ☆136Dec 9, 2025Updated 2 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated last year
- GPU programming related news and material links☆2,010Sep 17, 2025Updated 5 months ago
- ☆470Aug 28, 2025Updated 6 months ago
- NanoGPT (124M) in 2 minutes☆4,679Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆187Jan 19, 2026Updated last month
- Simple Transformer in Jax☆142Jun 22, 2024Updated last year
- Enemies for your LLM☆35Jan 20, 2026Updated last month
- ☆33Dec 10, 2025Updated 2 months ago
- H-Net Dynamic Hierarchical Architecture☆81Sep 11, 2025Updated 5 months ago
- ☆13Nov 27, 2025Updated 3 months ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Aug 15, 2020Updated 5 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Jul 15, 2022Updated 3 years ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Download, parse, and filter data from Literotica. Data-ready for The-Pile.☆11Sep 18, 2020Updated 5 years ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 2 years ago
- Memory optimized Mixture of Experts☆73Jul 25, 2025Updated 7 months ago
- rl from zero pretrain, can it be done? yes.☆287Sep 28, 2025Updated 5 months ago
- Cute layout visualization☆30Jan 18, 2026Updated last month
- ☆13Jun 3, 2024Updated last year
- Papers about infrastructure (deployment & serving) and systems for compound AI☆12Nov 6, 2024Updated last year
- Apps that run on modal.com☆13Sep 14, 2025Updated 5 months ago
- SMS Client for Twitter DMs☆11Jul 6, 2020Updated 5 years ago
- ☆12Jan 4, 2024Updated 2 years ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆162Oct 19, 2023Updated 2 years ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,099Aug 26, 2025Updated 6 months ago
- A PyTorch native platform for training generative AI models☆5,098Updated this week
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆478Feb 3, 2026Updated last month
- ☆31Dec 8, 2023Updated 2 years ago
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆253May 6, 2025Updated 9 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated last year
- ☆17Jul 9, 2025Updated 7 months ago
- nanoGPT using Equinox☆15Mar 3, 2023Updated 3 years ago
- ☆13Sep 13, 2023Updated 2 years ago
- Graph model execution API for Candle☆17Jul 27, 2025Updated 7 months ago
- Machine Learning Engineering Open Book☆17,162Feb 21, 2026Updated last week
- [WACV 2025] DistillDIFT: Distillation of Diffusion Features for Semantic Correspondence☆35Jul 10, 2025Updated 7 months ago