RAGEN-AI / VAGEN
☆121Updated this week
Alternatives and similar repositories for VAGEN:
Users that are interested in VAGEN are comparing it to the libraries listed below
- ☆153Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆188Updated this week
- Repo of paper "Free Process Rewards without Process Labels"☆146Updated last month
- ☆132Updated 2 weeks ago
- ☆173Updated last month
- ☆163Updated this week
- ☆102Updated last month
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆113Updated this week
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆198Updated this week
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 5 months ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆93Updated last month
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆72Updated 2 weeks ago
- ☆109Updated 3 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆138Updated 6 months ago
- ☆165Updated last month
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆135Updated 5 months ago
- A comprehensive collection of process reward models.☆76Updated this week
- A Comprehensive Survey on Long Context Language Modeling☆139Updated last month
- ☆287Updated last month
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆118Updated this week
- Towards Large Multimodal Models as Visual Foundation Agents☆210Updated 2 weeks ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆174Updated last month
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆93Updated this week
- ☆194Updated 2 months ago
- repo for paper https://arxiv.org/abs/2504.13837☆117Updated 2 weeks ago
- Natural Language Reinforcement Learning☆87Updated 4 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆89Updated 2 months ago
- SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning☆180Updated this week
- MPO: Boosting LLM Agents with Meta Plan Optimization☆51Updated 2 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆94Updated 3 weeks ago