Best practices for distilling large language models.
☆633Feb 1, 2024Updated 2 years ago
Alternatives and similar repositories for llm_distillation_playbook
Users that are interested in llm_distillation_playbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicit…☆1,293Mar 9, 2025Updated last year
- An Open Source Toolkit For LLM Distillation☆970May 12, 2026Updated last month
- A pipeline for LLM knowledge distillation☆116May 7, 2026Updated last month
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,800May 28, 2026Updated last month
- Easy to use, High Performant Knowledge Distillation for LLMs☆98May 5, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆267Mar 13, 2025Updated last year
- ☆595Sep 7, 2023Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,623May 26, 2026Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,300Jun 22, 2026Updated last week
- Tools for merging pretrained large language models.☆7,190Jun 17, 2026Updated last week
- Go ahead and axolotl questions☆12,082Updated this week
- Codes, scripts, and notebooks on various aspects of transformer models.☆26Feb 27, 2023Updated 3 years ago
- ☆48Feb 1, 2022Updated 4 years ago
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,415Jun 17, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Building GPT ...☆18Dec 1, 2024Updated last year
- LLM Finetuning with peft☆2,938Aug 1, 2025Updated 10 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,449Updated this week
- ☆16Jun 5, 2023Updated 3 years ago
- DSPy: The framework for programming—not prompting—language models☆35,605Updated this week
- Serving multiple LoRA finetuned LLM as one☆1,163May 8, 2024Updated 2 years ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,914Jan 21, 2024Updated 2 years ago
- Structured Outputs☆14,273Updated this week
- Data and tools for generating and inspecting OLMo pre-training data.☆1,515Nov 5, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch native post-training library☆5,777Updated this week
- This repository contains demos I made with the Transformers library by HuggingFace.☆11,653Apr 20, 2026Updated 2 months ago
- Minimalistic large language model 3D-parallelism training☆2,729May 26, 2026Updated last month
- A framework for few-shot evaluation of language models.☆13,106Updated this week
- A guidance language for controlling large language models.☆21,519May 21, 2026Updated last month
- ☆3,093Jun 16, 2026Updated 2 weeks ago
- Efficient Triton Kernels for LLM Training☆6,456Jun 23, 2026Updated last week
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,751Jun 25, 2024Updated 2 years ago
- Curated list of datasets and tools for post-training.☆4,665Apr 29, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆736Apr 10, 2024Updated 2 years ago
- ☆2,280Updated this week
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,692Mar 8, 2024Updated 2 years ago
- My Solution to Assignments of CS234(Stanford / Fall 2019)☆15Sep 3, 2020Updated 5 years ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- AllenAI's post-training codebase☆3,775Updated this week
- Chunk your text using gpt4o-mini more accurately☆44Aug 3, 2024Updated last year