Simple-Efficient / RL-FactoryLinks
Train your Agent model via our easy and efficient framework
☆1,697Updated 2 months ago
Alternatives and similar repositories for RL-Factory
Users that are interested in RL-Factory are comparing it to the libraries listed below
Sorting:
- minimal-cost for training 0.5B R1-Zero☆806Updated 8 months ago
- adds Sequence Parallelism into LLaMA-Factory☆603Updated 3 months ago
- [COLM’25] DeepRetrieval — 🔥 Training Search Agent by RLVR with Retrieval Outcome☆695Updated 3 months ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆489Updated 3 months ago
- ☆559Updated 4 months ago
- ☆1,115Updated 2 weeks ago
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,471Updated this week
- ☆333Updated 5 months ago
- A scalable, end-to-end training pipeline for general-purpose agents☆365Updated 7 months ago
- A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in Large Language Models☆106Updated 2 months ago
- This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).☆378Updated 5 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆1,201Updated this week
- [Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges☆2,402Updated 2 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆694Updated 3 months ago
- Align Anything: Training All-modality Model with Feedback☆4,625Updated 2 months ago
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆310Updated 4 months ago
- Recipes to train reward model for RLHF.☆1,512Updated 9 months ago
- ☆490Updated 3 months ago
- The official code of ARPO & AEPO☆880Updated last week
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆510Updated this week
- Codebase for Iterative DPO Using Rule-based Rewards☆267Updated 9 months ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆293Updated this week
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆683Updated 6 months ago
- The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.☆526Updated 6 months ago
- Awesome List for Agentic RL☆760Updated last month
- ☆427Updated 3 months ago
- A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)☆116Updated 2 months ago
- ☆761Updated last month
- When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification☆841Updated 2 months ago
- The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Eva…☆139Updated this week