camel-ai/loong

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/camel-ai/loong)

camel-ai / loong

🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.

☆506

Alternatives and similar repositories for loong

Users that are interested in loong are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

camel-ai / camel
View on GitHub
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
☆17,490Updated this week
facebookresearch / sweet_rl
View on GitHub
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆271May 5, 2025Updated last year
SalesforceAIResearch / PretrainRL-pipeline
View on GitHub
An automated data pipeline scaling RL to pretraining levels
☆76Jun 2, 2026Updated last month
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,756Updated this week
camel-ai / seta
View on GitHub
💻 SETA: Scaling Environments for Terminal Agents
☆127Jul 17, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Agent-RL / ReCall
View on GitHub
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…
☆1,421May 16, 2025Updated last year
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated last year
THU-KEG / VerIF
View on GitHub
[EMNLP 2025] Verification Engineering for RL in Instruction Following
☆57Mar 30, 2026Updated 3 months ago
sail-sg / understand-r1-zero
View on GitHub
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,268Aug 27, 2025Updated 10 months ago
sail-sg / oat
View on GitHub
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆666Jan 29, 2026Updated 5 months ago
MoonshotAI / checkpoint-engine
View on GitHub
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
☆982Jul 4, 2026Updated 3 weeks ago
NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,093Updated this week
hkust-nlp / simpleRL-reason
View on GitHub
Simple RL training for reasoning
☆3,870Dec 23, 2025Updated 7 months ago
TIGER-AI-Lab / General-Reasoner
View on GitHub
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
☆229Nov 27, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,731Updated this week
TIGER-AI-Lab / verl-tool
View on GitHub
A version of verl to support diverse tool use [TMLR 2026]
☆1,024Jul 15, 2026Updated last week
Open-Reasoner-Zero / Open-Reasoner-Zero
View on GitHub
Official Repo for Open-Reasoner-Zero
☆2,096Jun 2, 2025Updated last year
inclusionAI / PromptCoT
View on GitHub
A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…
☆132Jan 31, 2026Updated 5 months ago
OpenBMB / RLPR
View on GitHub
Extrapolating RLVR to General Domains without Verifiers
☆205Aug 12, 2025Updated 11 months ago
ChenxinAn-fdu / POLARIS
View on GitHub
Scaling RL on advanced reasoning models
☆692Oct 20, 2025Updated 9 months ago
EvanZhuang / knowledge_flow
View on GitHub
Official Implementation of Knowledge Flow Prompting
☆35Oct 20, 2025Updated 9 months ago
PrimeIntellect-ai / verifiers
View on GitHub
Our library for RL environments + evals
☆4,400Updated this week
RLHFlow / Self-rewarding-reasoning-LLM
View on GitHub
Recipes to train the self-rewarding reasoning LLMs.
☆231Mar 2, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Gen-Verse / Open-AgentRL
View on GitHub
RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios
☆592Jun 12, 2026Updated last month
PRIME-RL / PRIME
View on GitHub
Scalable RL solution for advanced reasoning of language models
☆1,865Mar 18, 2025Updated last year
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,629Updated this week
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,654Updated this week
huggingface / Math-Verify
View on GitHub
☆1,170Jan 10, 2026Updated 6 months ago
collinear-ai / spider
View on GitHub
Streamline on-policy/off-policy distillation workflows in a few lines of code
☆107Updated this week
GAIR-NLP / ProX
View on GitHub
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆271Jul 8, 2025Updated last year
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆5,153Nov 13, 2025Updated 8 months ago
Aloriosa / srmt
View on GitHub
The original Shared Recurrent Memory Transformer implementation
☆36Jul 11, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
camel-ai / owl
View on GitHub
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
☆20,063Updated this week
areal-project / AReaL
View on GitHub
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
☆5,599Updated this week
allenai / olmes
View on GitHub
Reproducible, flexible LLM evaluations
☆390Mar 24, 2026Updated 4 months ago
open-tinker / OpenTinker
View on GitHub
OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆676Mar 21, 2026Updated 4 months ago
Simple-Efficient / RL-Factory
View on GitHub
Train your Agent model via our easy and efficient framework
☆1,773Dec 5, 2025Updated 7 months ago
microsoft / rStar
View on GitHub
☆1,422Sep 12, 2025Updated 10 months ago
open-thoughts / open-thoughts
View on GitHub
Fully open data curation for reasoning models
☆2,308Dec 2, 2025Updated 7 months ago