π Loong: Synthesize Long CoTs at Scale through Verifiers.
β502Apr 17, 2026Updated 2 weeks ago
Alternatives and similar repositories for loong
Users that are interested in loong are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/β416Apr 17, 2026Updated 2 weeks ago
- Lego for GRPOβ30May 27, 2025Updated 11 months ago
- π« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.orgβ16,869Updated this week
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoningβ42Nov 11, 2025Updated 5 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,642Apr 14, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ266May 5, 2025Updated last year
- Streamline on-policy/off-policy distillation workflows in a few lines of codeβ101Feb 26, 2026Updated 2 months ago
- ποΈ OASIS: Open Agent Social Interaction Simulations with One Million Agents.β4,440Apr 17, 2026Updated 2 weeks ago
- Recipes to train the self-rewarding reasoning LLMs.β233Mar 2, 2025Updated last year
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reiβ¦β1,380May 16, 2025Updated 11 months ago
- Understanding R1-Zero-Like Training: A Critical Perspectiveβ1,248Aug 27, 2025Updated 8 months ago
- β10Feb 14, 2025Updated last year
- Simple & Scalable Pretraining for Neural Architecture Researchβ329Mar 31, 2026Updated last month
- β138Mar 20, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The original Shared Recurrent Memory Transformer implementationβ35Jul 11, 2025Updated 9 months ago
- Simple RL training for reasoningβ3,851Dec 23, 2025Updated 4 months ago
- β39Aug 4, 2025Updated 9 months ago
- Agentic testing for agentic codebasesβ869Updated this week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ4,609Nov 13, 2025Updated 5 months ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Followingβ54Mar 30, 2026Updated last month
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Predictionβ568May 6, 2025Updated 11 months ago
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ19,719Apr 17, 2026Updated 2 weeks ago
- Scalable RL solution for advanced reasoning of language modelsβ1,852Mar 18, 2025Updated last year
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official Repo for Open-Reasoner-Zeroβ2,093Jun 2, 2025Updated 11 months ago
- β28Apr 2, 2025Updated last year
- An open-source toolkit helping developers build natural language database query solutionsβ26May 5, 2025Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ130Jun 11, 2025Updated 10 months ago
- Awesome Reasoning LLM Tutorial/Survey/Guideβ2,391Apr 6, 2026Updated 3 weeks ago
- Parallel Scaling Law for Language Model β Beyond Parameter and Inference Time Scalingβ479May 17, 2025Updated 11 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]β672Jul 29, 2025Updated 9 months ago
- Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent wβ¦β106Sep 8, 2025Updated 7 months ago
- Fully open data curation for reasoning modelsβ2,253Dec 2, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- π€ The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"β117Apr 6, 2025Updated last year
- πΎ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.β652Jan 29, 2026Updated 3 months ago
- Model Activity Visualiserβ524Apr 9, 2025Updated last year
- AI Powered Logo Generator | Powered by Nebius AIβ481Apr 22, 2026Updated last week
- A version of verl to support diverse tool useβ968Mar 2, 2026Updated 2 months ago
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Frameworkβ21,046Updated this week
- Democratizing Reinforcement Learning for LLMsβ5,462Updated this week