YoungDubbyDu / LLM-Agent-Optimization
This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the list. Any suggestions and PRs are welcome!
☆93Updated 3 weeks ago
Alternatives and similar repositories for LLM-Agent-Optimization
Users that are interested in LLM-Agent-Optimization are comparing it to the libraries listed below
Sorting:
- ☆132Updated 2 weeks ago
- ☆55Updated 2 months ago
- ☆133Updated 3 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆103Updated last month
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆345Updated last month
- ☆173Updated last month
- Agentic RAG R1 Framework via Reinforcement Learning☆148Updated this week
- Awesome Agent Training☆106Updated this week
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆135Updated 4 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆92Updated 2 months ago
- MPO: Boosting LLM Agents with Meta Plan Optimization☆51Updated 2 months ago
- ☆153Updated last month
- ☆51Updated 8 months ago
- ☆151Updated 2 weeks ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆458Updated this week
- ☆102Updated 5 months ago
- ☆55Updated 7 months ago
- The demo, code and data of FollowRAG☆72Updated 3 weeks ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 5 months ago
- ☆163Updated last week
- ☆42Updated 2 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆76Updated last month
- Knowledge-Reasoning Synergy Reinforcement Learning.☆35Updated 2 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆289Updated last month
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆509Updated 3 weeks ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆41Updated 3 weeks ago
- A Survey on Multimodal Retrieval-Augmented Generation☆165Updated 3 weeks ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆113Updated last week
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆72Updated 3 weeks ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆168Updated 3 weeks ago