bytedance / Agent-R
Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"
☆112Updated last week
Alternatives and similar repositories for Agent-R:
Users that are interested in Agent-R are comparing it to the libraries listed below
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆86Updated 5 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆83Updated this week
- AWM: Agent Workflow Memory☆252Updated last month
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆68Updated last week
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆75Updated 2 weeks ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆193Updated this week
- ☆82Updated last month
- ☆185Updated last month
- ☆103Updated 2 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆148Updated last week
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆177Updated last week
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆62Updated last month
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆131Updated last month
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆209Updated this week
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆120Updated 3 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆78Updated 3 weeks ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆215Updated 2 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆330Updated last month
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆108Updated 3 weeks ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆165Updated this week
- ☆102Updated 3 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning☆306Updated 3 weeks ago
- ☆260Updated last week
- An implemtation of Everyting of Thoughts (XoT).☆141Updated last year
- Code for the paper 🌳 Tree Search for Language Model Agents☆186Updated 8 months ago
- ☆111Updated last month
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆74Updated 2 weeks ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆48Updated last month
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆106Updated last month
- MPO: Boosting LLM Agents with Meta Plan Optimization☆40Updated 2 weeks ago