This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the list. Any suggestions and PRs are welcome!
☆230Feb 26, 2026Updated 2 months ago
Alternatives and similar repositories for Awesome-LLM-Agent-Optimization-Papers
Users that are interested in Awesome-LLM-Agent-Optimization-Papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆81May 14, 2026Updated last week
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- ☆15Oct 28, 2024Updated last year
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆784Sep 11, 2025Updated 8 months ago
- An RL-Friendly Vision-Language Model for Minecraft☆41Oct 17, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Automatic prompt optimization framework for multi-step agent tasks.☆37Nov 12, 2024Updated last year
- Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓☆36Apr 3, 2025Updated last year
- A repo lists papers related to LLM based agent☆2,299Jul 12, 2025Updated 10 months ago
- ☆19Sep 19, 2024Updated last year
- [ACL'25 Oral] Code for the paper "UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban…☆30Jul 15, 2025Updated 10 months ago
- Survey on LLM Agents (Published on CoLing 2025)☆501Oct 3, 2025Updated 7 months ago
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆60Jul 24, 2025Updated 10 months ago
- This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling finan…☆75Jun 23, 2025Updated 11 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆157Dec 24, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆150Feb 19, 2025Updated last year
- ☆27Feb 13, 2026Updated 3 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆82Jan 18, 2024Updated 2 years ago
- ☆18Oct 6, 2025Updated 7 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆754May 10, 2026Updated 2 weeks ago
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- ☆166Jan 21, 2025Updated last year
- ☆50Sep 6, 2025Updated 8 months ago
- Repo for "AlphaResearch: Accelerating New Algorithm Discovery with Language Models"☆56Nov 12, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆24May 21, 2025Updated last year
- ☆25Jul 20, 2025Updated 10 months ago
- Paper list for Efficient Reasoning.☆886May 11, 2026Updated 2 weeks ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 9 months ago
- [EMNLP 2023 (Findings)] Schema-adaptable Knowledge Graph Construction☆22Jan 28, 2024Updated 2 years ago
- [ACL 2024] This is the code for our paper ”RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records“.☆41Sep 19, 2024Updated last year
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆19Jun 1, 2024Updated last year
- blade-chest model for matchup and comparison prediction☆14Jul 10, 2016Updated 9 years ago
- A curated list of LLM powered AI Agents in Biomedical Research. Medical Image Analysis, Multi-omics Genomics Analysis, Biomedical Scienti…☆75Sep 28, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- use angr to deobfuscation☆10Oct 8, 2019Updated 6 years ago
- OptiBench and ReSocratic Synthesis Method☆34Oct 2, 2025Updated 7 months ago
- ☆124Mar 23, 2025Updated last year
- ☆176Oct 29, 2025Updated 6 months ago
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆23Sep 25, 2025Updated 8 months ago
- Inducing Point Operator Transformer: A Flexible and Scalable Architecture for Solving PDEs (AAAI 2024)☆15Jul 30, 2024Updated last year
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆149Apr 9, 2025Updated last year