Best practices for distilling large language models.
☆622Feb 1, 2024Updated 2 years ago
Alternatives and similar repositories for llm_distillation_playbook
Users that are interested in llm_distillation_playbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicit…☆1,279Mar 9, 2025Updated last year
- An Open Source Toolkit For LLM Distillation☆931Mar 14, 2026Updated last month
- A pipeline for LLM knowledge distillation☆113Apr 16, 2026Updated 2 weeks ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,764May 21, 2025Updated 11 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆98May 5, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆258Mar 13, 2025Updated last year
- ☆589Sep 7, 2023Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,587Apr 8, 2026Updated 3 weeks ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,189Apr 20, 2026Updated last week
- Tools for merging pretrained large language models.☆7,023Mar 15, 2026Updated last month
- Go ahead and axolotl questions☆11,779Updated this week
- ☆48Feb 1, 2022Updated 4 years ago
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,362Updated this week
- Building GPT ...☆18Dec 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LLM Finetuning with peft☆2,918Aug 1, 2025Updated 9 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,326Updated this week
- ☆16Jun 5, 2023Updated 2 years ago
- DSPy: The framework for programming—not prompting—language models☆34,016Apr 24, 2026Updated last week
- Serving multiple LoRA finetuned LLM as one☆1,155May 8, 2024Updated last year
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,909Jan 21, 2024Updated 2 years ago
- Structured Outputs☆13,741Apr 16, 2026Updated 2 weeks ago
- ☆27Aug 5, 2024Updated last year
- Data and tools for generating and inspecting OLMo pre-training data.☆1,487Nov 5, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch native post-training library☆5,739Apr 24, 2026Updated last week
- This repository contains demos I made with the Transformers library by HuggingFace.☆11,592Apr 20, 2026Updated last week
- Minimalistic large language model 3D-parallelism training☆2,663Apr 7, 2026Updated 3 weeks ago
- A framework for few-shot evaluation of language models.☆12,331Apr 22, 2026Updated last week
- A guidance language for controlling large language models.☆21,408Apr 10, 2026Updated 3 weeks ago
- ☆3,091Nov 21, 2025Updated 5 months ago
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,727Jun 25, 2024Updated last year
- Curated list of datasets and tools for post-training.☆4,500Updated this week
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆736Apr 10, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Efficient Triton Kernels for LLM Training☆6,315Updated this week
- ☆2,228Apr 17, 2026Updated 2 weeks ago
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,677Mar 8, 2024Updated 2 years ago
- My Solution to Assignments of CS234(Stanford / Fall 2019)☆15Sep 3, 2020Updated 5 years ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- AllenAI's post-training codebase☆3,702Updated this week
- Chunk your text using gpt4o-mini more accurately☆44Aug 3, 2024Updated last year