camel-ai / loongView external linksLinks
π Loong: Synthesize Long CoTs at Scale through Verifiers.
β485Feb 6, 2026Updated last week
Alternatives and similar repositories for loong
Users that are interested in loong are comparing it to the libraries listed below
Sorting:
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoningβ42Nov 11, 2025Updated 3 months ago
- Lego for GRPOβ30May 27, 2025Updated 8 months ago
- π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/β396Feb 6, 2026Updated last week
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,511Jan 25, 2026Updated 3 weeks ago
- Streamline on-policy/off-policy distillation workflows in a few lines of codeβ95Feb 5, 2026Updated last week
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reiβ¦β1,317May 16, 2025Updated 8 months ago
- ποΈ OASIS: Open Agent Social Interaction Simulations with One Million Agents.β2,462Updated this week
- π« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.orgβ16,021Updated this week
- Simple & Scalable Pretraining for Neural Architecture Researchβ308Dec 6, 2025Updated 2 months ago
- Understanding R1-Zero-Like Training: A Critical Perspectiveβ1,205Aug 27, 2025Updated 5 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ261May 5, 2025Updated 9 months ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Followingβ50Jan 5, 2026Updated last month
- β37Aug 4, 2025Updated 6 months ago
- Recipes to train the self-rewarding reasoning LLMs.β229Mar 2, 2025Updated 11 months ago
- β137Mar 20, 2025Updated 10 months ago
- Simple RL training for reasoningβ3,830Dec 23, 2025Updated last month
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Predictionβ567May 6, 2025Updated 9 months ago
- The original Shared Recurrent Memory Transformer implementationβ33Jul 11, 2025Updated 7 months ago
- β10Feb 14, 2025Updated last year
- β17Aug 5, 2025Updated 6 months ago
- Parallel Scaling Law for Language Model β Beyond Parameter and Inference Time Scalingβ469May 17, 2025Updated 8 months ago
- Scalable RL solution for advanced reasoning of language modelsβ1,803Mar 18, 2025Updated 10 months ago
- Agentic testing for agentic codebasesβ721Updated this week
- πΎ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.β627Jan 29, 2026Updated 2 weeks ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ3,975Nov 13, 2025Updated 3 months ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agentsβ551Updated this week
- Official Repo for Open-Reasoner-Zeroβ2,087Jun 2, 2025Updated 8 months ago
- β18Dec 9, 2025Updated 2 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ125Jun 11, 2025Updated 8 months ago
- Fully open data curation for reasoning modelsβ2,206Dec 2, 2025Updated 2 months ago
- A version of verl to support diverse tool useβ868Jan 6, 2026Updated last month
- Awesome Reasoning LLM Tutorial/Survey/Guideβ2,286Oct 14, 2025Updated 4 months ago
- β334May 24, 2025Updated 8 months ago
- This repo documents my workflows and stack to run comfy ui GenANI assist under windowsβ30Jan 25, 2026Updated 2 weeks ago
- π€ The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"β110Apr 6, 2025Updated 10 months ago
- The official repo for βUnleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problemβ [EMNLP25]β34Sep 1, 2025Updated 5 months ago
- β123Feb 21, 2025Updated 11 months ago
- Model Activity Visualiserβ521Apr 9, 2025Updated 10 months ago
- Spec-driven thinking, nano-sized docs. Lightweight task specification for AI-assisted development.β36Jan 25, 2026Updated 3 weeks ago