LLM checkpointing for DeepSpeed/Megatron
☆25Nov 30, 2025Updated 3 months ago
Alternatives and similar repositories for datastates-llm
Users that are interested in datastates-llm are comparing it to the libraries listed below
Sorting:
- A resilient distributed training framework☆97Apr 11, 2024Updated last year
- Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".☆47Jul 12, 2024Updated last year
- ☆15Apr 11, 2024Updated last year
- ☆16Apr 7, 2024Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆17Feb 24, 2026Updated last week
- GO GO EXPERIMENTAL LAB☆17Feb 15, 2026Updated 2 weeks ago
- ☆19May 4, 2023Updated 2 years ago
- ☆22Apr 22, 2024Updated last year
- Fast and memory-efficient exact attention☆29Dec 2, 2024Updated last year
- quick playground to animate pippin☆14Nov 11, 2024Updated last year
- ☆34Sep 10, 2024Updated last year
- PyTorch implementation for our ICLR 2024 paper "Diffusion Generative Flow Samplers: Improving learning signals through partial trajectory…☆26Dec 21, 2023Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆43Nov 9, 2023Updated 2 years ago
- ☆11Nov 17, 2015Updated 10 years ago
- an autonomous independent digital companion☆14Feb 12, 2026Updated 2 weeks ago
- Discord Docsbot, Built on bgent☆11Jun 17, 2024Updated last year
- [Developmental] Quarto Extension to Enable Google Colaboratory Links with Quarto Documents☆15May 18, 2025Updated 9 months ago
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 6 months ago
- A speicifically designed KV store for blockchain systems☆11Mar 10, 2025Updated 11 months ago
- Literate Python package development with Jupyter☆12Aug 18, 2025Updated 6 months ago
- An open-source key-value SSD emulator built on top of FEMU. (ASPLOS '25)☆12Mar 31, 2025Updated 11 months ago
- Cookiecutter template for making a cog for Red.☆12Jun 18, 2024Updated last year
- MCP server for Google search and page fetching using headless Chromium☆67Feb 21, 2026Updated last week
- MATLAB code for Stein Point Markov Chain Monte Carlo.☆13Jul 3, 2019Updated 6 years ago
- Extract streaming data from text using prefix completion.☆10Oct 6, 2024Updated last year
- Modern normalizing flows in Python. Simple to use and easily extensible.☆12Feb 11, 2026Updated 3 weeks ago
- Canonical normalizing flows☆10Apr 30, 2019Updated 6 years ago
- ☆134May 29, 2025Updated 9 months ago
- ☆49Jul 17, 2025Updated 7 months ago
- Material for the SC22 Deep Learning at Scale Tutorial☆41Jul 14, 2023Updated 2 years ago
- A project that analyses recent carbon emissions worldwide☆14Mar 26, 2024Updated last year
- ☆16Jan 14, 2025Updated last year
- ☆11Jun 5, 2024Updated last year
- Improving transparency of large language models' reasoning☆14Nov 25, 2025Updated 3 months ago
- A python API for Lattice QCD applications☆10Sep 28, 2020Updated 5 years ago
- Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models☆11Jan 23, 2024Updated 2 years ago
- SplitBud is a Split Learning framework built upon Flower☆14Mar 22, 2025Updated 11 months ago
- Continuously tempered Hamiltonian Monte Carlo☆12Apr 12, 2017Updated 8 years ago
- An attempt to fix the light-themed webpages!☆12Feb 7, 2018Updated 8 years ago