camel-ai / loongLinks
π Loong: Synthesize Long CoTs at Scale through Verifiers.
β478Updated this week
Alternatives and similar repositories for loong
Users that are interested in loong are comparing it to the libraries listed below
Sorting:
- β308Updated 3 months ago
- [Up-to-date] Awesome Agentic Deep Research Resourcesβ586Updated 4 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agentsβ530Updated last month
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agentsβ522Updated last month
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemenβ¦β539Updated 3 months ago
- OpenTinker is an RL-as-a-Service infrastructure for foundation modelsβ499Updated this week
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.β511Updated 3 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"β166Updated 2 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.β845Updated 5 months ago
- β254Updated 4 months ago
- [EMNLP 2025] Awesome RAG Reasoning Resourcesβ369Updated 5 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agentsβ298Updated 2 months ago
- β789Updated 2 months ago
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike statβ¦β407Updated last month
- Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Researchβ483Updated this week
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"β645Updated 9 months ago
- β867Updated 4 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.β355Updated 6 months ago
- AgentFlow: In-the-Flow Agentic System Optimizationβ1,461Updated 2 weeks ago
- π οΈ DeepAgent: A General Reasoning Agent with Scalable Toolsetsβ906Updated this week
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"β542Updated 2 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Predictionβ566Updated 7 months ago
- [NeurIPS 2025 Spotlight] ReasonFlux (long-CoT), ReasonFlux-PRM (process reward model) and ReasonFlux-Coder (code generation)β512Updated 3 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.β679Updated 2 months ago
- Next paradigm for LLM Agent. Unify plan and action through recursive code generation for adaptive, human-like decision-making.β519Updated last month
- Latent Collaboration in Multi-Agent Systemsβ668Updated last week
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RLβ489Updated 6 months ago
- β310Updated 5 months ago
- π©ββοΈ Agent-as-a-Judge: The Magic for Open-Endednessβ696Updated 7 months ago
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scalingβ624Updated last month