Aegis1863 / LLMs-Distillation-QuantificationLinks
Repo of ACL 2025 main Paper "Quantification of Large Language Model Distillation"
☆88Updated last month
Alternatives and similar repositories for LLMs-Distillation-Quantification
Users that are interested in LLMs-Distillation-Quantification are comparing it to the libraries listed below
Sorting:
- ☆89Updated last month
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆95Updated last month
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆60Updated last month
- ☆102Updated 7 months ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆174Updated 3 weeks ago
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆95Updated 3 months ago
- ☆82Updated last year
- ☆70Updated 4 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- ☆59Updated 3 weeks ago
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆103Updated last month
- ☆94Updated 7 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆45Updated last week
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆195Updated last week
- Efficient Agent Training for Computer Use☆111Updated last month
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆76Updated last week
- ☆154Updated 2 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆110Updated last month
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆227Updated last month
- ☆47Updated last month
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆52Updated last month
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆251Updated this week
- A Comprehensive Library for Memory of LLM-based Agents.☆47Updated last month
- Official Implementation of "Reasoning Language Models: A Blueprint"☆69Updated 3 weeks ago
- ☆95Updated 6 months ago
- ☆70Updated last month
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆79Updated last month
- ☆277Updated last month
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆57Updated 9 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆125Updated 2 weeks ago