YoungDubbyDu / LLM-Agent-OptimizationLinks

This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the list. Any suggestions and PRs are welcome!

☆111

Alternatives and similar repositories for LLM-Agent-Optimization

Users that are interested in LLM-Agent-Optimization are comparing it to the libraries listed below

Sorting:

qiancheng0 / ToolRL
☆241Updated 2 weeks ago
bruno686 / Awesome-Agent-Training
Awesome Agent Training
☆164Updated this week
thinkwee / AgentsMeetRL
An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.
☆154Updated this week
yanweiyue / masrouter
☆67Updated 3 weeks ago
bytarnish / AGILE
☆144Updated 5 months ago
0russwest0 / Awesome-Agent-RL
☆242Updated last month
dongguanting / Tool-Star
Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning
☆155Updated last week
ADaM-BJTU / AutoCoA
AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…
☆116Updated 3 months ago
ADaM-BJTU / OpenRFT
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning
☆144Updated 6 months ago
zjunlp / WorfBench
[ICLR 2025] Benchmarking Agentic Workflow Generation
☆97Updated 4 months ago
GAIR-NLP / ToRL
☆220Updated last month
GAIR-NLP / DeepResearcher
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
☆461Updated 2 months ago
scienceaix / deepresearch
Awesome Deep Research list
☆104Updated last week
WeiminXiong / MPO
MPO: Boosting LLM Agents with Meta Plan Optimization
☆58Updated 3 months ago
jiangxinke / Agentic-RAG-R1
Agentic RAG R1 Framework via Reinforcement Learning
☆215Updated last month
Open-Source-O1 / o1_Reasoning_Patterns_Study
☆102Updated 6 months ago
RyanLiu112 / GenPRM
Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
☆75Updated 2 weeks ago
ReTool-RL / ReTool
☆119Updated last month
0russwest0 / Agent-R1
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
☆573Updated 3 weeks ago
LightChen233 / reasoning-boundary
☆62Updated last week
NumberChiffre / mcts-llm
☆95Updated 6 months ago
LCLM-Horizon / A-Comprehensive-Survey-For-Long-Context-Language-Modeling
A Comprehensive Survey on Long Context Language Modeling
☆152Updated 2 weeks ago
Reason-Wang / ToolGen
[ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"
☆143Updated 2 months ago
Gen-Verse / ScoreFlow
Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"
☆78Updated last month
zjunlp / AutoAct
[ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
☆227Updated 5 months ago
MingyuJ666 / Disentangling-Memory-and-Reasoning
[ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.
☆60Updated last month
bingreeky / MaAS
[ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet
☆97Updated 2 weeks ago
Tim-Siu / reft-exp
A research repo for experiments about Reinforcement Finetuning
☆48Updated 2 months ago
Wangmerlyn / MCTS-GSM8k-Demo
This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
☆84Updated 3 months ago
dongguanting / FollowRAG
The demo, code and data of FollowRAG
☆73Updated 2 months ago