predibase / llm_distillation_playbook
Best practices for distilling large language models.
☆531Updated last year
Alternatives and similar repositories for llm_distillation_playbook
Users that are interested in llm_distillation_playbook are comparing it to the libraries listed below
Sorting:
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,516Updated last week
- ☆515Updated 5 months ago
- LLM Workshop by Sourab Mangrulkar☆380Updated 11 months ago
- An Open Source Toolkit For LLM Distillation☆596Updated 2 weeks ago
- Official repository for ORPO☆452Updated 11 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆840Updated last week
- Recipes to scale inference-time compute of open models☆1,071Updated last week
- ☆1,182Updated 2 months ago
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,466Updated 3 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆699Updated last month
- Generative Representational Instruction Tuning☆628Updated 2 months ago
- awesome synthetic (text) datasets☆281Updated 6 months ago
- List of papers on hallucination detection in LLMs.☆862Updated last week
- Automatically evaluate your LLMs in Google Colab☆622Updated last year
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆353Updated 8 months ago
- System 2 Reasoning Link Collection☆833Updated 2 months ago
- ☆1,019Updated 4 months ago
- A reading list on LLM based Synthetic Data Generation 🔥☆1,265Updated 2 months ago
- Minimalistic large language model 3D-parallelism training☆1,870Updated this week
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆791Updated 2 weeks ago
- Easily embed, cluster and semantically label text datasets☆534Updated last year
- A bibliography and survey of the papers surrounding o1☆1,192Updated 6 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,700Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 💯☆938Updated 3 weeks ago
- Data and tools for generating and inspecting OLMo pre-training data.☆1,209Updated 3 weeks ago
- Automatic evals for LLMs☆388Updated this week
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,526Updated last year
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆608Updated last year
- YaRN: Efficient Context Window Extension of Large Language Models☆1,484Updated last year
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,377Updated last week