bytedance / Agent-R

Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"

☆112

Alternatives and similar repositories for Agent-R:

Users that are interested in Agent-R are comparing it to the libraries listed below

siyuyuan / evoagent
Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"
☆86Updated 5 months ago
facebookresearch / sweet_rl
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆83Updated this week
zorazrw / agent-workflow-memory
AWM: Agent Workflow Memory
☆252Updated last month
ADaM-BJTU / AutoCoA
AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…
☆68Updated last week
THU-KEG / Agentic-Reward-Modeling
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆75Updated 2 weeks ago
OSU-NLP-Group / UGround
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
☆193Updated this week
satori-reasoning / Satori
☆82Updated last month
vsubramaniam851 / multiagent-ft
☆185Updated last month
jwhj / OREO
☆103Updated 2 months ago
cmu-l3 / l1
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
☆148Updated last week
diagram-of-thought / diagram-of-thought
Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)
☆177Updated last week
zjunlp / WorfBench
[ICLR 2025] Benchmarking Agentic Workflow Generation
☆62Updated last month
TIGER-AI-Lab / CritiqueFineTuning
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"
☆131Updated last month
facebookresearch / RAM
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
☆209Updated this week
zjunlp / WKM
[NeurIPS 2024] Agent Planning with World Knowledge Model
☆120Updated 3 months ago
Yu-Fangxu / FoR
Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples
☆78Updated 3 weeks ago
zjunlp / AutoAct
[ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
☆215Updated 2 months ago
THUDM / WebRL
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
☆330Updated last month
kyegomez / Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
☆108Updated 3 weeks ago
StonyBrookNLP / appworld
🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…
☆165Updated this week
Open-Source-O1 / o1_Reasoning_Patterns_Study
☆102Updated 3 months ago
Agent-RL / ReSearch
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
☆306Updated 3 weeks ago
eddycmu / demystify-long-cot
☆260Updated last week
microsoft / Everything-of-Thoughts-XoT
An implemtation of Everyting of Thoughts (XoT).
☆141Updated last year
kohjingyu / search-agents
Code for the paper 🌳 Tree Search for Language Model Agents
☆186Updated 8 months ago
SalesforceAIResearch / LaTRO
☆111Updated last month
CMU-AIRe / MRT
Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".
☆74Updated 2 weeks ago
zou-group / sirius
SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning
☆48Updated last month
AgnostiqHQ / multi-agent-llm
Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)
☆106Updated last month
WeiminXiong / MPO
MPO: Boosting LLM Agents with Meta Plan Optimization
☆40Updated 2 weeks ago