π Loong: Synthesize Long CoTs at Scale through Verifiers.
β499Apr 1, 2026Updated last week
Alternatives and similar repositories for loong
Users that are interested in loong are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/β412Apr 1, 2026Updated last week
- π« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.orgβ16,619Updated this week
- Lego for GRPOβ30May 27, 2025Updated 10 months ago
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoningβ42Nov 11, 2025Updated 4 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,576Mar 28, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ264May 5, 2025Updated 11 months ago
- Streamline on-policy/off-policy distillation workflows in a few lines of codeβ98Feb 26, 2026Updated last month
- ποΈ OASIS: Open Agent Social Interaction Simulations with One Million Agents.β4,183Apr 1, 2026Updated last week
- Recipes to train the self-rewarding reasoning LLMs.β231Mar 2, 2025Updated last year
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reiβ¦β1,360May 16, 2025Updated 10 months ago
- Understanding R1-Zero-Like Training: A Critical Perspectiveβ1,239Aug 27, 2025Updated 7 months ago
- The original Shared Recurrent Memory Transformer implementationβ34Jul 11, 2025Updated 8 months ago
- Simple & Scalable Pretraining for Neural Architecture Researchβ325Mar 31, 2026Updated last week
- β10Feb 14, 2025Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β137Mar 20, 2025Updated last year
- Simple RL training for reasoningβ3,846Dec 23, 2025Updated 3 months ago
- β39Aug 4, 2025Updated 8 months ago
- An open-source toolkit helping developers build natural language database query solutionsβ26May 5, 2025Updated 11 months ago
- Agentic testing for agentic codebasesβ813Updated this week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ4,355Nov 13, 2025Updated 4 months ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Followingβ53Mar 30, 2026Updated last week
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Predictionβ568May 6, 2025Updated 11 months ago
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ19,318Apr 1, 2026Updated last week
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Scalable RL solution for advanced reasoning of language modelsβ1,841Mar 18, 2025Updated last year
- Official Repo for Open-Reasoner-Zeroβ2,087Jun 2, 2025Updated 10 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ126Jun 11, 2025Updated 9 months ago
- Awesome Reasoning LLM Tutorial/Survey/Guideβ2,343Oct 14, 2025Updated 5 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]β656Jul 29, 2025Updated 8 months ago
- Parallel Scaling Law for Language Model β Beyond Parameter and Inference Time Scalingβ478May 17, 2025Updated 10 months ago
- Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent wβ¦β103Sep 8, 2025Updated 7 months ago
- Fully open data curation for reasoning modelsβ2,237Dec 2, 2025Updated 4 months ago
- π€ The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"β117Apr 6, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- πΎ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.β645Jan 29, 2026Updated 2 months ago
- Model Activity Visualiserβ520Apr 9, 2025Updated last year
- A version of verl to support diverse tool useβ947Mar 2, 2026Updated last month
- AI Powered Logo Generator | Powered by Nebius AIβ479May 16, 2025Updated 10 months ago
- verl: Volcano Engine Reinforcement Learning for LLMsβ20,443Apr 3, 2026Updated last week
- Democratizing Reinforcement Learning for LLMsβ5,363Apr 3, 2026Updated last week
- ContextGem: Effortless LLM extraction from documentsβ1,822Mar 16, 2026Updated 3 weeks ago