ByteDance-Seed / Agent-R
Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"
☆132Updated last month
Alternatives and similar repositories for Agent-R
Users that are interested in Agent-R are comparing it to the libraries listed below
Sorting:
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆193Updated last week
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆97Updated 6 months ago
- ☆65Updated 2 weeks ago
- ☆201Updated 2 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆107Updated last month
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆92Updated 2 months ago
- ☆93Updated 3 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆202Updated this week
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆146Updated 3 weeks ago
- SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning☆261Updated this week
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆143Updated last week
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆223Updated 4 months ago
- ☆110Updated 3 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆59Updated last month
- ☆131Updated this week
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆179Updated last month
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆85Updated last month
- AWM: Agent Workflow Memory☆270Updated 3 months ago
- "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆73Updated last month
- ☆181Updated 3 weeks ago
- ☆168Updated last month
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆74Updated this week
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆42Updated 3 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆147Updated 2 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆53Updated last month
- Official code repository for Sketch-of-Thought (SoT)☆112Updated last week
- MPO: Boosting LLM Agents with Meta Plan Optimization☆52Updated 2 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 3 months ago
- ☆153Updated 3 weeks ago