This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the list. Any suggestions and PRs are welcome!
☆238Jun 17, 2026Updated 2 weeks ago
Alternatives and similar repositories for Awesome-LLM-Agent-Optimization-Papers
Users that are interested in Awesome-LLM-Agent-Optimization-Papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 这是对基于大模型的多智能体系统论文的总结☆10Jun 23, 2024Updated 2 years ago
- ☆84May 14, 2026Updated last month
- ☆21Jun 9, 2025Updated last year
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- ☆15Oct 28, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An efficient hierarchical Graph-based RAG☆41Nov 27, 2025Updated 7 months ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆806May 30, 2026Updated last month
- An RL-Friendly Vision-Language Model for Minecraft☆41Oct 17, 2024Updated last year
- Automatic prompt optimization framework for multi-step agent tasks.☆37Nov 12, 2024Updated last year
- Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓☆36Apr 3, 2025Updated last year
- A repo lists papers related to LLM based agent☆2,324Jul 12, 2025Updated 11 months ago
- ☆19Sep 19, 2024Updated last year
- Survey on LLM Agents (Published on CoLing 2025)☆508Oct 3, 2025Updated 9 months ago
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆60Jul 24, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling finan…☆77Jun 23, 2025Updated last year
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆157Dec 24, 2024Updated last year
- Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"☆66Dec 4, 2025Updated 7 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆153Feb 19, 2025Updated last year
- ☆28May 30, 2026Updated last month
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆82Jan 18, 2024Updated 2 years ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆32Mar 6, 2026Updated 3 months ago
- AI, especially Deep Learning, has made breakthroughs in learning from Brain Signals, vital for both Brain Encoding and Decoding. Unlock t…☆17Sep 17, 2025Updated 9 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆770May 10, 2026Updated last month
- ☆166Jan 21, 2025Updated last year
- ☆54Sep 6, 2025Updated 9 months ago
- Repo for "AlphaResearch: Accelerating New Algorithm Discovery with Language Models"☆57Nov 12, 2025Updated 7 months ago
- ☆23May 21, 2025Updated last year
- ☆25Jul 20, 2025Updated 11 months ago
- [ACM MM 24] The implementation of paper Low-rank Prompt Interaction for Continual Vision-language Retrieval☆17Nov 20, 2024Updated last year
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 10 months ago
- [EMNLP 2023 (Findings)] Schema-adaptable Knowledge Graph Construction☆23Jan 28, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- AstroAgents: Multi-Agent AI for Hypothesis Generation from Mass Spectrometry Data☆15Apr 1, 2025Updated last year
- [ACL 2024] This is the code for our paper ”RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records“.☆42Sep 19, 2024Updated last year
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆19Jun 1, 2024Updated 2 years ago
- A curated list of LLM powered AI Agents in Biomedical Research. Medical Image Analysis, Multi-omics Genomics Analysis, Biomedical Scienti…☆77Sep 28, 2025Updated 9 months ago
- use angr to deobfuscation☆10Oct 8, 2019Updated 6 years ago
- ☆89Sep 11, 2024Updated last year
- ☆133Mar 23, 2025Updated last year