jongwooko/distillm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jongwooko/distillm)

jongwooko / distillm

Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)

☆266

Alternatives and similar repositories for distillm

Users that are interested in distillm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

songmzhang / DSKD
View on GitHub
Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…
☆63Mar 21, 2026Updated 4 months ago
Tebmer / Awesome-Knowledge-Distillation-of-LLMs
View on GitHub
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicit…
☆1,294Mar 9, 2025Updated last year
jongwooko / distillm-2
View on GitHub
Official PyTorch implementation of DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs (ICML 2025 Oral)
☆71Jun 27, 2025Updated last year
Nicolas-BZRD / llm-recipes
View on GitHub
☆33Mar 13, 2024Updated 2 years ago
wutaiqiang / LLM_KD_AKL
View on GitHub
☆22Oct 22, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
arcee-ai / DistillKit
View on GitHub
An Open Source Toolkit For LLM Distillation
☆986May 12, 2026Updated 2 months ago
golololologol / LLM-Distillery
View on GitHub
A pipeline for LLM knowledge distillation
☆116May 7, 2026Updated 2 months ago
jongwooko / Pytorch-MiniLM
View on GitHub
Unofficial Pytorch implementation of MiniLM and MiniLMv2
☆23Jan 30, 2022Updated 4 years ago
jongwooko / NASH-Pruning-Official
View on GitHub
Code Implementation for "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP …
☆17Oct 17, 2023Updated 2 years ago
raymin0223 / fast_robust_early_exit
View on GitHub
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)
☆67Sep 28, 2024Updated last year
kimyuji / EvolvingQA_benchmark
View on GitHub
Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)
☆10Oct 16, 2024Updated last year
raymin0223 / self-contrastive-learning
View on GitHub
Self-Contrastive Learning: Single-viewed Supervised Contrastive Framework using Sub-network (AAAI 2023)
☆21Oct 28, 2023Updated 2 years ago
microsoft / LMOps
View on GitHub
General technology for enabling AI capabilities w/ LLMs and MLLMs
☆4,438Jun 17, 2026Updated last month
AIoT-MLSys-Lab / SVD-LLM
View on GitHub
[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2
☆301Aug 28, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ghwang-s / abkd
View on GitHub
ICML 2025 Oral: ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via α-β-Divergence
☆46Aug 8, 2025Updated 11 months ago
Nicolas-BZRD / llm-distillation
View on GitHub
☆11Feb 3, 2025Updated last year
jxiw / MambaInLlama
View on GitHub
[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models
☆242Oct 14, 2025Updated 9 months ago
princeton-nlp / LLM-Shearing
View on GitHub
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
☆640Mar 4, 2024Updated 2 years ago
cliang1453 / task-aware-distillation
View on GitHub
Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)
☆40Aug 28, 2023Updated 2 years ago
hkust-nlp / deita
View on GitHub
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆599Dec 9, 2024Updated last year
horseee / Awesome-Efficient-LLM
View on GitHub
A curated list for Efficient Large Language Models
☆2,023Jun 17, 2025Updated last year
sungnyun / cav2vec
View on GitHub
(ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
☆16Apr 29, 2025Updated last year
schauppi / Self-Rewarding-Language-Models
View on GitHub
☆50May 13, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
thu-coai / MiniPLM
View on GitHub
[ICLR 2025] MiniPLM: Knowledge Distillation for Pre-Training Language Models
☆79Nov 23, 2024Updated last year
predibase / llm_distillation_playbook
View on GitHub
Best practices for distilling large language models.
☆636Feb 1, 2024Updated 2 years ago
google-research / distilling-step-by-step
View on GitHub
☆597Sep 7, 2023Updated 2 years ago
horseee / LLM-Pruner
View on GitHub
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baich…
☆1,130Oct 7, 2024Updated last year
MANGA-UOFA / fdistill
View on GitHub
☆22Feb 4, 2026Updated 5 months ago
RUCAIBox / RLMEC
View on GitHub
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
☆39Jan 12, 2024Updated 2 years ago
sramshetty / ShortGPT
View on GitHub
Unofficial implementations of block/layer-wise pruning methods for LLMs.
☆78Apr 29, 2024Updated 2 years ago
lin-honghui / data-competition-calendar
View on GitHub
国内外数据竞赛资讯整理
☆18Nov 6, 2021Updated 4 years ago
llyx97 / Rosita
View on GitHub
[AAAI 2021] "ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques", Yuanxin Liu, Zheng Lin, Fengcheng Yuan
☆14Oct 18, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
BM-K / KoDiffCSE
View on GitHub
Difference-based Contrastive Learning for Korean Sentence Embeddings
☆23Mar 11, 2026Updated 4 months ago
NVlabs / Minitron
View on GitHub
A family of compressed models obtained via pruning and knowledge distillation
☆383Nov 6, 2025Updated 8 months ago
linusericsson / ssl-invariances
View on GitHub
Official code for the paper "Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks".
☆16Dec 7, 2021Updated 4 years ago
locuslab / wanda
View on GitHub
A simple and effective LLM pruning approach.
☆868Aug 9, 2024Updated last year
microsoft / only_train_once
View on GitHub
OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM
☆51Oct 10, 2024Updated last year
facebookresearch / LLM-QAT
View on GitHub
Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"
☆325Mar 4, 2025Updated last year
microsoft / rho
View on GitHub
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆470Apr 18, 2024Updated 2 years ago