π Loong: Synthesize Long CoTs at Scale through Verifiers.
β504May 15, 2026Updated 3 weeks ago
Alternatives and similar repositories for loong
Users that are interested in loong are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/β423Updated this week
- Lego for GRPOβ30May 27, 2025Updated last year
- π« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.orgβ17,160Updated this week
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoningβ41Nov 11, 2025Updated 7 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,699Apr 14, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ268May 5, 2025Updated last year
- Streamline on-policy/off-policy distillation workflows in a few lines of codeβ105Feb 26, 2026Updated 3 months ago
- ποΈ OASIS: Open Agent Social Interaction Simulations with One Million Agents.β4,774May 15, 2026Updated 3 weeks ago
- Recipes to train the self-rewarding reasoning LLMs.β232Mar 2, 2025Updated last year
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reiβ¦β1,389May 16, 2025Updated last year
- Understanding R1-Zero-Like Training: A Critical Perspectiveβ1,261Aug 27, 2025Updated 9 months ago
- β10Feb 14, 2025Updated last year
- Simple & Scalable Pretraining for Neural Architecture Researchβ333Mar 31, 2026Updated 2 months ago
- β138Mar 20, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The original Shared Recurrent Memory Transformer implementationβ36Jul 11, 2025Updated 11 months ago
- Simple RL training for reasoningβ3,864Dec 23, 2025Updated 5 months ago
- β39Aug 4, 2025Updated 10 months ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Followingβ55Mar 30, 2026Updated 2 months ago
- Agentic testing for agentic codebasesβ895Updated this week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ4,925Nov 13, 2025Updated 7 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Predictionβ571May 6, 2025Updated last year
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ19,842Updated this week
- Scalable RL solution for advanced reasoning of language modelsβ1,862Mar 18, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Repo for Open-Reasoner-Zeroβ2,097Jun 2, 2025Updated last year
- β28Apr 2, 2025Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ131Jun 11, 2025Updated last year
- Awesome Reasoning LLM Tutorial/Survey/Guideβ2,441Apr 6, 2026Updated 2 months ago
- Parallel Scaling Law for Language Model β Beyond Parameter and Inference Time Scalingβ478May 17, 2025Updated last year
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]β686Jul 29, 2025Updated 10 months ago
- Fully open data curation for reasoning modelsβ2,278Dec 2, 2025Updated 6 months ago
- π€ The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"β117Apr 6, 2025Updated last year
- πΎ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.β661Jan 29, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Democratizing Reinforcement Learning for LLMsβ5,608Updated this week
- Model Activity Visualiserβ523Apr 9, 2025Updated last year
- AI Powered Logo Generator | Powered by Nebius AIβ485Apr 22, 2026Updated last month
- A version of verl to support diverse tool use [TMLR 2026]β994Updated this week
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Frameworkβ21,850Updated this week
- ContextGem: Effortless LLM extraction from documentsβ1,845Jun 6, 2026Updated last week
- Reproducible, flexible LLM evaluationsβ378Mar 24, 2026Updated 2 months ago