π Loong: Synthesize Long CoTs at Scale through Verifiers.
β505Jun 23, 2026Updated last week
Alternatives and similar repositories for loong
Users that are interested in loong are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/β424Jun 23, 2026Updated last week
- Lego for GRPOβ30May 27, 2025Updated last year
- π« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.orgβ17,305Jun 28, 2026Updated last week
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoningβ41Nov 11, 2025Updated 7 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,734Apr 14, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ271May 5, 2025Updated last year
- Streamline on-policy/off-policy distillation workflows in a few lines of codeβ105Updated this week
- ποΈ OASIS: Open Agent Social Interaction Simulations with One Million Agents.β4,875Jun 23, 2026Updated last week
- Recipes to train the self-rewarding reasoning LLMs.β232Mar 2, 2025Updated last year
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reiβ¦β1,402May 16, 2025Updated last year
- Understanding R1-Zero-Like Training: A Critical Perspectiveβ1,265Aug 27, 2025Updated 10 months ago
- Simple & Scalable Pretraining for Neural Architecture Researchβ335Mar 31, 2026Updated 3 months ago
- β138Mar 20, 2025Updated last year
- The original Shared Recurrent Memory Transformer implementationβ36Jul 11, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Simple RL training for reasoningβ3,867Dec 23, 2025Updated 6 months ago
- β39Aug 4, 2025Updated 11 months ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Followingβ56Mar 30, 2026Updated 3 months ago
- Agentic testing for agentic codebasesβ907Updated this week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ5,051Nov 13, 2025Updated 7 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Predictionβ572May 6, 2025Updated last year
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ19,916Jun 23, 2026Updated last week
- Scalable RL solution for advanced reasoning of language modelsβ1,864Mar 18, 2025Updated last year
- Official Repo for Open-Reasoner-Zeroβ2,096Jun 2, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β28Apr 2, 2025Updated last year
- An open-source toolkit helping developers build natural language database query solutionsβ26May 5, 2025Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ133Jun 11, 2025Updated last year
- Awesome Reasoning LLM Tutorial/Survey/Guideβ2,464Apr 6, 2026Updated 2 months ago
- Parallel Scaling Law for Language Model β Beyond Parameter and Inference Time Scalingβ480May 17, 2025Updated last year
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]β696Jul 29, 2025Updated 11 months ago
- Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent wβ¦β113Sep 8, 2025Updated 9 months ago
- Fully open data curation for reasoning modelsβ2,295Dec 2, 2025Updated 7 months ago
- π€ The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"β118Apr 6, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- π» SETA: Scaling Environments for Terminal Agentsβ116Feb 16, 2026Updated 4 months ago
- πΎ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.β664Jan 29, 2026Updated 5 months ago
- Democratizing Reinforcement Learning for LLMsβ5,677Updated this week
- Model Activity Visualiserβ523Apr 9, 2025Updated last year
- AI Powered Logo Generator | Powered by Nebius AIβ487Apr 22, 2026Updated 2 months ago
- A version of verl to support diverse tool use [TMLR 2026]β1,008Jun 8, 2026Updated 3 weeks ago
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Frameworkβ22,173Jun 27, 2026Updated last week