YoungDubbyDu / LLM-Agent-Optimization
This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the list. Any suggestions and PRs are welcome!
☆84Updated last week
Alternatives and similar repositories for LLM-Agent-Optimization:
Users that are interested in LLM-Agent-Optimization are comparing it to the libraries listed below
- ☆153Updated 3 weeks ago
- Awesome Agent Training☆33Updated this week
- ☆126Updated 3 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆244Updated last week
- ☆52Updated 2 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆101Updated last month
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆79Updated 2 months ago
- ☆405Updated this week
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆72Updated last month
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆133Updated 4 months ago
- Knowledge-Reasoning Synergy Reinforcement Learning.☆34Updated last month
- ☆51Updated 7 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆241Updated last week
- MPO: Boosting LLM Agents with Meta Plan Optimization☆50Updated last month
- ☆157Updated 3 weeks ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆27Updated last week
- ☆101Updated 4 months ago
- ☆93Updated 4 months ago
- ☆55Updated 6 months ago
- ☆125Updated 3 weeks ago
- 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability☆147Updated 2 weeks ago
- StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization☆126Updated 3 months ago
- A research repo for experiments about Reinforcement Finetuning☆44Updated 2 weeks ago
- ☆40Updated last month
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 4 months ago
- A comprehensive collection of process reward models.☆67Updated this week
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆107Updated this week
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆477Updated this week
- connecting humans and agents☆82Updated 4 months ago
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆47Updated 2 months ago