predibase / llm_distillation_playbookView external linksLinks
Best practices for distilling large language models.
☆604Feb 1, 2024Updated 2 years ago
Alternatives and similar repositories for llm_distillation_playbook
Users that are interested in llm_distillation_playbook are comparing it to the libraries listed below
Sorting:
- This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicit…☆1,252Mar 9, 2025Updated 11 months ago
- An Open Source Toolkit For LLM Distillation☆860Dec 21, 2025Updated last month
- A pipeline for LLM knowledge distillation☆112Apr 2, 2025Updated 10 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,718May 21, 2025Updated 8 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆97May 5, 2025Updated 9 months ago
- Robust recipes to align language models with human and AI preferences☆5,495Sep 8, 2025Updated 5 months ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆250Mar 13, 2025Updated 11 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,084Jan 26, 2026Updated 2 weeks ago
- ☆580Sep 7, 2023Updated 2 years ago
- Tools for merging pretrained large language models.☆6,783Jan 26, 2026Updated 2 weeks ago
- Go ahead and axolotl questions☆11,289Updated this week
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,284Dec 22, 2025Updated last month
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,897Jan 21, 2024Updated 2 years ago
- Serving multiple LoRA finetuned LLM as one☆1,139May 8, 2024Updated last year
- LLM Finetuning with peft☆2,767Aug 1, 2025Updated 6 months ago
- Minimalistic large language model 3D-parallelism training☆2,544Dec 11, 2025Updated 2 months ago
- Implementing BERT + CRF with PyTorch for Chinese NER.☆10Mar 7, 2022Updated 3 years ago
- Building GPT ...☆18Dec 1, 2024Updated last year
- Codes, scripts, and notebooks on various aspects of transformer models.☆27Feb 27, 2023Updated 2 years ago
- PyTorch native post-training library☆5,669Updated this week
- Structured Outputs☆13,403Feb 6, 2026Updated last week
- DSPy: The framework for programming—not prompting—language models☆32,156Updated this week
- A framework for few-shot evaluation of language models.☆11,393Updated this week
- Data and tools for generating and inspecting OLMo pre-training data.☆1,404Nov 5, 2025Updated 3 months ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,852May 17, 2025Updated 8 months ago
- Automatically evaluate your LLMs in Google Colab☆685May 7, 2024Updated last year
- This repository contains the implementation of evaluation metrics for recommendation systems. We have compared similarity, candidate gene…☆27Feb 21, 2025Updated 11 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆26Updated this week
- This repository contains demos I made with the Transformers library by HuggingFace.☆11,494Jan 13, 2026Updated last month
- ☆3,069Nov 21, 2025Updated 2 months ago
- Efficient Triton Kernels for LLM Training☆6,141Updated this week
- A guidance language for controlling large language models.☆21,270Feb 6, 2026Updated last week
- Machine Learning Engineering Open Book☆16,675Updated this week
- AllenAI's post-training codebase☆3,573Updated this week
- Curated list of datasets and tools for post-training.☆4,242Nov 10, 2025Updated 3 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,885Updated this week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,875Jan 9, 2026Updated last month
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,705Jun 25, 2024Updated last year