Nardien/agent-distillation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Nardien/agent-distillation)

Nardien / agent-distillation

Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"

☆251

Alternatives and similar repositories for agent-distillation

Users that are interested in agent-distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ByungKwanLee / Distill-R1
View on GitHub
Open-source RL Framework with Online Teacher-Student Distillation
☆22Mar 5, 2026Updated 4 months ago
db-Lee / Multi-RM
View on GitHub
☆17Jul 17, 2026Updated last week
seanie12 / ThinkSafe
View on GitHub
☆21May 4, 2026Updated 2 months ago
kaistAI / knowledge-reasoning
View on GitHub
[EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…
☆23Dec 4, 2024Updated last year
microsoft / acon
View on GitHub
Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"
☆98Oct 14, 2025Updated 9 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
songmzhang / DSKDv2
View on GitHub
The official implementation of the paper "A Dual-Space Framework for General Knowledge Distillation of Large Language Models".
☆18Jan 4, 2026Updated 6 months ago
Essential-AI / eai-taxonomy
View on GitHub
☆59Aug 19, 2025Updated 11 months ago
brendanhogan / completion_tree_view
View on GitHub
☆15Apr 26, 2025Updated last year
shangshang-wang / Tina
View on GitHub
[ICLR 2026] Tina: Tiny Reasoning Models via LoRA
☆338Sep 23, 2025Updated 10 months ago
arcee-ai / DistillKit
View on GitHub
An Open Source Toolkit For LLM Distillation
☆992May 12, 2026Updated 2 months ago
aniemerg / smolcc
View on GitHub
A lightweight code assistant with tool-using capabilities built on HuggingFace's smolagents.
☆41Jun 11, 2025Updated last year
OPPO-PersonalAI / Agent_Foundation_Models
View on GitHub
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.
☆580Sep 8, 2025Updated 10 months ago
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
zwhong714 / Hybrid-Policy-Distillation
View on GitHub
[ICML 2026] Hybrid Policy Distillation (HPD) is a practical distillation framework for reasoning-oriented language models. This repositor…
☆24Apr 24, 2026Updated 3 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
JinheonBaek / OmniRetrieval
View on GitHub
Official Code Repository for OmniRetrieval
☆33Jun 1, 2026Updated last month
NathanGodey / qfilters
View on GitHub
Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)
☆34Mar 7, 2025Updated last year
LHL3341 / MetaLadder
View on GitHub
MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)
☆12Apr 18, 2025Updated last year
golololologol / LLM-Distillery
View on GitHub
A pipeline for LLM knowledge distillation
☆116May 7, 2026Updated 2 months ago
StarDewXXX / AdaR1
View on GitHub
The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"
☆24May 6, 2026Updated 2 months ago
McGill-NLP / agent-reward-bench
View on GitHub
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
☆48Aug 7, 2025Updated 11 months ago
EIT-NLP / Distilling-CoT-Reasoning
View on GitHub
[ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".
☆22Feb 26, 2025Updated last year
VainF / Thinkless
View on GitHub
[NeurIPS 2025] Thinkless: LLM Learns When to Think
☆261Sep 26, 2025Updated 10 months ago
OPPO-PersonalAI / OAgents
View on GitHub
Implementation for OAgents: An Empirical Study of Building Effective Agents
☆327Oct 13, 2025Updated 9 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Open-Galapagos / evolution-fine-tuning
View on GitHub
Official code, models, and dataset for "Evolution Fine-Tuning (EFT): Learning to Discover Across 371 Optimization Tasks"
☆25Jun 30, 2026Updated 3 weeks ago
thunlp / NOSA
View on GitHub
The official implementation of NOSA
☆19Jun 11, 2026Updated last month
wutaiqiang / LLM_KD_AKL
View on GitHub
☆22Oct 22, 2024Updated last year
RUCAIBox / RLMEC
View on GitHub
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
☆39Jan 12, 2024Updated 2 years ago
haebin-seong / HarmAug
View on GitHub
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
☆14Mar 6, 2025Updated last year
uservan / ThinkPO
View on GitHub
☆17Aug 1, 2025Updated 11 months ago
CoopReason / TESSY
View on GitHub
A Teacher–Student Cooperation Framework to Synthesize Student-Consistent SFT Data
☆35May 1, 2026Updated 2 months ago
Sein-Kim / self_evolverec
View on GitHub
☆19Updated this week
thu-coai / Backdoor-Data-Extraction
View on GitHub
☆33May 22, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jej127 / Trace
View on GitHub
☆17Dec 23, 2025Updated 7 months ago
jej127 / KOPL
View on GitHub
Korean Oov Processing System
☆17Dec 13, 2024Updated last year
OSU-NLP-Group / Mind2Web-2
View on GitHub
[NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge
☆112May 17, 2026Updated 2 months ago
abdelfattah-lab / TokenButler
View on GitHub
☆27May 12, 2026Updated 2 months ago
Dozi01 / MetaSPO
View on GitHub
☆83Oct 1, 2025Updated 9 months ago
tpoisonooo / open-r1
View on GitHub
Fully open reproduction of DeepSeek-R1
☆11Mar 24, 2025Updated last year
VILA-Lab / DRAG
View on GitHub
(ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillat…
☆35Aug 23, 2025Updated 11 months ago