Best practices for distilling large language models.
☆628Feb 1, 2024Updated 2 years ago
Alternatives and similar repositories for llm_distillation_playbook
Users that are interested in llm_distillation_playbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicit…☆1,286Mar 9, 2025Updated last year
- An Open Source Toolkit For LLM Distillation☆959May 12, 2026Updated 3 weeks ago
- A pipeline for LLM knowledge distillation☆115May 7, 2026Updated last month
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,790May 28, 2026Updated last week
- Easy to use, High Performant Knowledge Distillation for LLMs☆98May 5, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆264Mar 13, 2025Updated last year
- ☆593Sep 7, 2023Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,608May 26, 2026Updated 2 weeks ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,240Jun 1, 2026Updated last week
- Tools for merging pretrained large language models.☆7,126May 6, 2026Updated last month
- Go ahead and axolotl questions☆12,001Updated this week
- Codes, scripts, and notebooks on various aspects of transformer models.☆26Feb 27, 2023Updated 3 years ago
- ☆48Feb 1, 2022Updated 4 years ago
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,403Jun 4, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Ludwig benchmark☆20May 11, 2026Updated last month
- Building GPT ...☆18Dec 1, 2024Updated last year
- LLM Finetuning with peft☆2,929Aug 1, 2025Updated 10 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,414Updated this week
- ☆16Jun 5, 2023Updated 3 years ago
- DSPy: The framework for programming—not prompting—language models☆34,958Updated this week
- Serving multiple LoRA finetuned LLM as one☆1,159May 8, 2024Updated 2 years ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,914Jan 21, 2024Updated 2 years ago
- Structured Outputs☆13,947May 18, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆29Aug 5, 2024Updated last year
- Data and tools for generating and inspecting OLMo pre-training data.☆1,508Nov 5, 2025Updated 7 months ago
- PyTorch native post-training library☆5,768Updated this week
- This repository contains demos I made with the Transformers library by HuggingFace.☆11,635Apr 20, 2026Updated last month
- Minimalistic large language model 3D-parallelism training☆2,711May 26, 2026Updated 2 weeks ago
- A framework for few-shot evaluation of language models.☆12,885Jun 2, 2026Updated last week
- A guidance language for controlling large language models.☆21,488May 21, 2026Updated 2 weeks ago
- ☆3,094Nov 21, 2025Updated 6 months ago
- Efficient Triton Kernels for LLM Training☆6,415Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,748Jun 25, 2024Updated last year
- Curated list of datasets and tools for post-training.☆4,628Apr 29, 2026Updated last month
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆736Apr 10, 2024Updated 2 years ago
- ☆2,262Updated this week
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,684Mar 8, 2024Updated 2 years ago
- My Solution to Assignments of CS234(Stanford / Fall 2019)☆15Sep 3, 2020Updated 5 years ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago