DaertML / context_distillationLinks
Framework to achieve context distillation in LLMs
☆15Updated 2 years ago
Alternatives and similar repositories for context_distillation
Users that are interested in context_distillation are comparing it to the libraries listed below
Sorting:
- ☆21Updated last year
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆77Updated last year
- ☆21Updated 7 months ago
- Data mapping framework for rust stuff☆44Updated this week
- Open Implementations of LLM Analyses☆107Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated 2 years ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆103Updated last year
- ☆34Updated last year
- Repository for CPU Kernel Generation for LLM Inference☆27Updated 2 years ago
- ☆64Updated last year
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆40Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated 2 years ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- FuseAI Project☆87Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆46Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆46Updated last year
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Updated last month
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆46Updated 5 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- [TMLR 2026] When Attention Collapses: How Degenerate Layers in LLMs Enable Smaller, Stronger Models☆121Updated 11 months ago
- ☆41Updated last year
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated 2 years ago
- 🚢 Data Toolkit for Sailor Language Models☆95Updated 11 months ago
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- ☆82Updated 2 months ago
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆126Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- a curated list of the role of small models in the LLM era☆111Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆101Updated 2 years ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆81Updated 2 years ago