π Loong: Synthesize Long CoTs at Scale through Verifiers.
β503May 15, 2026Updated last week
Alternatives and similar repositories for loong
Users that are interested in loong are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/β421May 15, 2026Updated last week
- Lego for GRPOβ30May 27, 2025Updated 11 months ago
- π« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.orgβ16,988May 18, 2026Updated last week
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoningβ41Nov 11, 2025Updated 6 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,668Apr 14, 2026Updated last month
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ268May 5, 2025Updated last year
- Streamline on-policy/off-policy distillation workflows in a few lines of codeβ103Feb 26, 2026Updated 3 months ago
- ποΈ OASIS: Open Agent Social Interaction Simulations with One Million Agents.β4,606May 15, 2026Updated last week
- Recipes to train the self-rewarding reasoning LLMs.β232Mar 2, 2025Updated last year
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reiβ¦β1,385May 16, 2025Updated last year
- Understanding R1-Zero-Like Training: A Critical Perspectiveβ1,259Aug 27, 2025Updated 8 months ago
- β10Feb 14, 2025Updated last year
- Simple & Scalable Pretraining for Neural Architecture Researchβ332Mar 31, 2026Updated last month
- β138Mar 20, 2025Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The original Shared Recurrent Memory Transformer implementationβ36Jul 11, 2025Updated 10 months ago
- Simple RL training for reasoningβ3,859Dec 23, 2025Updated 5 months ago
- β39Aug 4, 2025Updated 9 months ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Followingβ55Mar 30, 2026Updated last month
- Agentic testing for agentic codebasesβ880May 17, 2026Updated last week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ4,753Nov 13, 2025Updated 6 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Predictionβ570May 6, 2025Updated last year
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ19,799May 15, 2026Updated last week
- Scalable RL solution for advanced reasoning of language modelsβ1,859Mar 18, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Repo for Open-Reasoner-Zeroβ2,091Jun 2, 2025Updated 11 months ago
- An open-source toolkit helping developers build natural language database query solutionsβ26May 5, 2025Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ131Jun 11, 2025Updated 11 months ago
- Awesome Reasoning LLM Tutorial/Survey/Guideβ2,416Apr 6, 2026Updated last month
- Parallel Scaling Law for Language Model β Beyond Parameter and Inference Time Scalingβ479May 17, 2025Updated last year
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]β679Jul 29, 2025Updated 9 months ago
- π» SETA: Scaling Environments for Terminal Agentsβ105Feb 16, 2026Updated 3 months ago
- Fully open data curation for reasoning modelsβ2,261Dec 2, 2025Updated 5 months ago
- Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent wβ¦β110Sep 8, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- π€ The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"β117Apr 6, 2025Updated last year
- πΎ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.β657Jan 29, 2026Updated 3 months ago
- Democratizing Reinforcement Learning for LLMsβ5,548Updated this week
- Model Activity Visualiserβ524Apr 9, 2025Updated last year
- AI Powered Logo Generator | Powered by Nebius AIβ483Apr 22, 2026Updated last month
- A version of verl to support diverse tool useβ982Mar 2, 2026Updated 2 months ago
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Frameworkβ21,514Updated this week