Best practices for distilling large language models.
☆626Feb 1, 2024Updated 2 years ago
Alternatives and similar repositories for llm_distillation_playbook
Users that are interested in llm_distillation_playbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicit…☆1,280Mar 9, 2025Updated last year
- An Open Source Toolkit For LLM Distillation☆942May 12, 2026Updated last week
- A pipeline for LLM knowledge distillation☆113May 7, 2026Updated 2 weeks ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,781May 21, 2025Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆97May 5, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆262Mar 13, 2025Updated last year
- ☆592Sep 7, 2023Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,602Apr 8, 2026Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,217Apr 27, 2026Updated 3 weeks ago
- Tools for merging pretrained large language models.☆7,083May 6, 2026Updated 2 weeks ago
- Go ahead and axolotl questions☆11,938Updated this week
- Codes, scripts, and notebooks on various aspects of transformer models.☆26Feb 27, 2023Updated 3 years ago
- ☆48Feb 1, 2022Updated 4 years ago
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,388Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Building GPT ...☆18Dec 1, 2024Updated last year
- LLM Finetuning with peft☆2,924Aug 1, 2025Updated 9 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,364May 1, 2026Updated 2 weeks ago
- ☆16Jun 5, 2023Updated 2 years ago
- DSPy: The framework for programming—not prompting—language models☆34,496Updated this week
- Serving multiple LoRA finetuned LLM as one☆1,160May 8, 2024Updated 2 years ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,913Jan 21, 2024Updated 2 years ago
- Structured Outputs☆13,846May 13, 2026Updated last week
- Data and tools for generating and inspecting OLMo pre-training data.☆1,497Nov 5, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch native post-training library☆5,754Updated this week
- This repository contains demos I made with the Transformers library by HuggingFace.☆11,627Apr 20, 2026Updated last month
- Minimalistic large language model 3D-parallelism training☆2,690Apr 7, 2026Updated last month
- A framework for few-shot evaluation of language models.☆12,595May 11, 2026Updated last week
- A guidance language for controlling large language models.☆21,461May 6, 2026Updated 2 weeks ago
- ☆3,091Nov 21, 2025Updated 6 months ago
- Efficient Triton Kernels for LLM Training☆6,365Updated this week
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,741Jun 25, 2024Updated last year
- Curated list of datasets and tools for post-training.☆4,585Apr 29, 2026Updated 3 weeks ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆736Apr 10, 2024Updated 2 years ago
- ☆2,242May 11, 2026Updated last week
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,679Mar 8, 2024Updated 2 years ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- AllenAI's post-training codebase☆3,726Updated this week
- Chunk your text using gpt4o-mini more accurately☆44Aug 3, 2024Updated last year
- Machine Learning Engineering Open Book☆17,948Updated this week