Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun
☆58Mar 10, 2025Updated last year
Alternatives and similar repositories for nano-mdm
Users that are interested in nano-mdm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Focused on fast experimentation and simplicity☆80Dec 24, 2024Updated last year
- ☆19Dec 31, 2025Updated 4 months ago
- ☆27May 3, 2024Updated 2 years ago
- Minimal Implementation of a D3PM in pytorch☆296Apr 22, 2024Updated 2 years ago
- ☆53Jan 6, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [Poster; ICLR 2026] [Oral; Neurips OPT2024] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆16Apr 15, 2026Updated 3 weeks ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated 2 years ago
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆19Sep 22, 2021Updated 4 years ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Feb 27, 2025Updated last year
- ☆114May 29, 2023Updated 2 years ago
- LLaDA implementation☆19Jul 24, 2025Updated 9 months ago
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆19Oct 21, 2024Updated last year
- Code for the paper "Function-Space Learning Rates"☆24Jun 3, 2025Updated 11 months ago
- ☆34Sep 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.