Aegis1863 / LLMs-Distillation-QuantificationLinks

Repo of ACL 2025 main Paper "Quantification of Large Language Model Distillation"

☆89

Alternatives and similar repositories for LLMs-Distillation-Quantification

Users that are interested in LLMs-Distillation-Quantification are comparing it to the libraries listed below

Sorting:

Open-Source-O1 / o1_Reasoning_Patterns_Study
☆103Updated 7 months ago
Ayanami0730 / deep_research_bench
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
☆233Updated last week
QwenLM / WorldPM
☆90Updated 2 months ago
yyht / openrlhf_async_pipline
☆70Updated this week
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆33Updated last year
hzy312 / knowledge-r1
IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent
☆61Updated 2 months ago
NumberChiffre / mcts-llm
☆95Updated 7 months ago
GAIR-NLP / PC-Agent-E
Efficient Agent Training for Computer Use
☆120Updated last month
THU-KEG / Agentic-Reward-Modeling
[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆97Updated last month
SkyworkAI / Skywork-MoE
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
☆136Updated last year
SuperGPQA / SuperGPQA
☆157Updated 3 months ago
bigai-nlco / TokenSwift
[ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation
☆110Updated 2 months ago
GAIR-NLP / ProX
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆255Updated 3 weeks ago
GAIR-NLP / ReAlign
Reformatted Alignment
☆113Updated 10 months ago
THUDM / ChatGLM-Math
☆83Updated last year
diagram-of-thought / diagram-of-thought
Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)
☆184Updated 4 months ago
RUC-NLPIR / HiRA
The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search
☆51Updated 3 weeks ago
GAIR-NLP / OctoThinker
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆159Updated last week
Bui1dMySea / MemLong
☆94Updated 7 months ago
open-compass / CompassJudger
The All-in-one Judge Models introduced by Opencompass
☆108Updated 2 weeks ago
RUC-GSAI / YuLan-Mini
A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.
☆200Updated last week
OFA-Sys / DiverseEvol
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning
☆83Updated last year
OPPO-PersonalAI / OAgents
Implementation for OAgents: An Empirical Study of Building Effective Agents
☆98Updated this week
McGill-NLP / agent-reward-bench
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
☆29Updated last week
GAIR-NLP / AIME-Preview
☆71Updated 4 months ago
LLM360 / MegaMath
[COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.
☆96Updated 3 months ago
SqueezeAILab / LLM2LLM
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
☆186Updated last year
Wangmerlyn / MCTS-GSM8k-Demo
This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
☆85Updated 4 months ago
THUDM / T1
RL Scaling and Test-Time Scaling (ICML'25)
☆109Updated 6 months ago
zwhe99 / DeepMath
A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
☆239Updated last month