camel-ai / loongLinks
π Loong: Synthesize Long CoTs at Scale through Verifiers.
β429Updated 2 weeks ago
Alternatives and similar repositories for loong
Users that are interested in loong are comparing it to the libraries listed below
Sorting:
- β404Updated last week
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemenβ¦β342Updated last week
- [Up-to-date] Awesome Agentic Deep Research Resourcesβ462Updated 3 weeks ago
- [EMNLP 2025] Awesome RAG Reasoning Resourcesβ295Updated last month
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"β160Updated 3 months ago
- β166Updated last month
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.β406Updated last week
- β472Updated 2 weeks ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.β665Updated last month
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agentsβ389Updated last month
- Implementation for OAgents: An Empirical Study of Building Effective Agentsβ258Updated 3 weeks ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"β147Updated this week
- Scaling Data for SWE-agentsβ399Updated last week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ244Updated 4 months ago
- β803Updated 3 weeks ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.β596Updated 5 months ago
- β274Updated last month
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.β340Updated 2 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Dataβ151Updated this week
- β205Updated last month
- β214Updated 6 months ago
- Repository for Zochi's Researchβ267Updated 3 weeks ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Predictionβ548Updated 4 months ago
- AWM: Agent Workflow Memoryβ321Updated 7 months ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)β618Updated last week
- Code for the paper: "Learning to Reason without External Rewards"β354Updated 2 months ago
- Atom of Thoughts for Markov LLM Test-Time Scalingβ586Updated 3 months ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoningβ252Updated 3 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"β261Updated 4 months ago
- A benchmark for LLMs on complicated tasks in the terminalβ691Updated this week