π Loong: Synthesize Long CoTs at Scale through Verifiers.
β487Mar 4, 2026Updated this week
Alternatives and similar repositories for loong
Users that are interested in loong are comparing it to the libraries listed below
Sorting:
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoningβ42Nov 11, 2025Updated 3 months ago
- Lego for GRPOβ30May 27, 2025Updated 9 months ago
- π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/β403Updated this week
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,527Feb 27, 2026Updated last week
- Streamline on-policy/off-policy distillation workflows in a few lines of codeβ96Feb 26, 2026Updated last week
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reiβ¦β1,338May 16, 2025Updated 9 months ago
- π« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.orgβ16,172Updated this week
- ποΈ OASIS: Open Agent Social Interaction Simulations with One Million Agents.β2,539Updated this week
- Simple & Scalable Pretraining for Neural Architecture Researchβ309Dec 6, 2025Updated 3 months ago
- Understanding R1-Zero-Like Training: A Critical Perspectiveβ1,222Aug 27, 2025Updated 6 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ263May 5, 2025Updated 10 months ago
- β37Aug 4, 2025Updated 7 months ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Followingβ51Jan 5, 2026Updated 2 months ago
- Recipes to train the self-rewarding reasoning LLMs.β231Mar 2, 2025Updated last year
- β137Mar 20, 2025Updated 11 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Predictionβ568May 6, 2025Updated 10 months ago
- Simple RL training for reasoningβ3,830Dec 23, 2025Updated 2 months ago
- β10Feb 14, 2025Updated last year
- The original Shared Recurrent Memory Transformer implementationβ33Jul 11, 2025Updated 7 months ago
- β17Aug 5, 2025Updated 7 months ago
- Parallel Scaling Law for Language Model β Beyond Parameter and Inference Time Scalingβ474May 17, 2025Updated 9 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ4,135Nov 13, 2025Updated 3 months ago
- Scalable RL solution for advanced reasoning of language modelsβ1,811Mar 18, 2025Updated 11 months ago
- πΎ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.β637Jan 29, 2026Updated last month
- Official Repo for Open-Reasoner-Zeroβ2,084Jun 2, 2025Updated 9 months ago
- Agentic testing for agentic codebasesβ785Mar 2, 2026Updated last week
- β18Dec 9, 2025Updated 3 months ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agentsβ584Mar 3, 2026Updated last week
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ125Jun 11, 2025Updated 8 months ago
- Fully open data curation for reasoning modelsβ2,225Dec 2, 2025Updated 3 months ago
- Awesome Reasoning LLM Tutorial/Survey/Guideβ2,314Oct 14, 2025Updated 4 months ago
- β335May 24, 2025Updated 9 months ago
- A version of verl to support diverse tool useβ889Mar 2, 2026Updated last week
- This repo documents my workflows and stack to run comfy ui GenANI assist under windowsβ31Feb 14, 2026Updated 3 weeks ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]β649Jul 29, 2025Updated 7 months ago
- π€ The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"β112Apr 6, 2025Updated 11 months ago
- The official repo for βUnleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problemβ [EMNLP25]β34Sep 1, 2025Updated 6 months ago
- β123Feb 21, 2025Updated last year
- Model Activity Visualiserβ521Apr 9, 2025Updated 11 months ago