agentica-project / rllmLinks

☆268

Alternatives and similar repositories for rllm

Users that are interested in rllm are comparing it to the libraries listed below

Sorting:

inclusionAI / ASearcher
An Open-Source Large-Scale Reinforcement Learning Project for Search Agents
☆492Updated last month
facebookresearch / sweet_rl
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆250Updated 6 months ago
OPPO-PersonalAI / Agent_Foundation_Models
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.
☆490Updated 2 months ago
WooooDyy / AgentGym-RL
Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…
☆495Updated 2 months ago
ReTool-RL / ReTool
☆231Updated 3 months ago
facebookresearch / meta-agents-research-environments
Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…
☆364Updated this week
TsinghuaC3I / MARTI
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
☆348Updated this week
sunblaze-ucb / Intuitor
Code for the paper: "Learning to Reason without External Rewards"
☆375Updated 4 months ago
zwhe99 / DeepMath
A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
☆273Updated last month
ypwang61 / One-Shot-RLVR
[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example
☆376Updated last month
ByteDance-Seed / Agent-R
Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"
☆161Updated last month
knoveleng / open-rs
Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"
☆268Updated last month
TIGER-AI-Lab / verl-tool
A version of verl to support diverse tool use
☆701Updated this week
GAIR-NLP / ToRL
☆309Updated 5 months ago
axon-rl / gem
A Gym for Agentic LLMs
☆361Updated last week
cmu-l3 / l1
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
☆257Updated 6 months ago
SWE-bench / SWE-smith
[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents
☆463Updated this week
MiniMax-AI / SynLogic
[NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
☆186Updated 4 months ago
OPPO-PersonalAI / OAgents
Implementation for OAgents: An Empirical Study of Building Effective Agents
☆282Updated last month
StonyBrookNLP / appworld
🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource…
☆312Updated last week
ltzheng / SimpleTIR
End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
☆318Updated 2 months ago
ChenxinAn-fdu / POLARIS
Scaling RL on advanced reasoning models
☆632Updated last month
Gen-Verse / ReasonFlux
[NeurIPS 2025 Spotlight] ReasonFlux (long-CoT), ReasonFlux-PRM (process reward model) and ReasonFlux-Coder (code generation)
☆501Updated last month
vsubramaniam851 / multiagent-ft
☆222Updated 8 months ago
ruixin31 / Spurious_Rewards
☆341Updated 3 months ago
ADaM-BJTU / O1-CODER
AN O1 REPLICATION FOR CODING
☆337Updated 11 months ago
facebookresearch / swe-rl
[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
☆618Updated 8 months ago
camel-ai / loong
🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.
☆460Updated last month
eddycmu / demystify-long-cot
☆326Updated 5 months ago
BytedTsinghua-SIA / MemAgent
A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
☆785Updated 3 months ago