camel-ai / loong
π Loong: Synthesize Long CoTs at Scale through Verifiers.
β272Updated this week
Alternatives and similar repositories for loong
Users that are interested in loong are comparing it to the libraries listed below
Sorting:
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learningβ847Updated 2 weeks ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.β363Updated last month
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoningβ183Updated last week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ193Updated last week
- free and open OpenAI Deep Researchβ549Updated 2 months ago
- Search-o1: Agentic Search-Enhanced Large Reasoning Modelsβ863Updated this week
- β181Updated 3 weeks ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"β517Updated 2 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learningβ521Updated 3 weeks ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RLβ377Updated 2 weeks ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"β224Updated this week
- II-Researcher: a new open-source framework designed to aid building search / research agentsβ248Updated last week
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generationβ297Updated 6 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reasoβ¦β107Updated last month
- β201Updated 2 months ago
- β181Updated last month
- β65Updated 2 weeks ago
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understandingβ144Updated last month
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".β230Updated 8 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"β132Updated last month
- β93Updated 3 months ago
- ZeroSearch: Incentivize the Search Capability of LLMs without Searchingβ788Updated this week
- OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinkingβ448Updated 3 weeks ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ90Updated 2 months ago
- AWM: Agent Workflow Memoryβ270Updated 3 months ago
- π WebWalker: Benchmarking LLMs in Web Traversalβ396Updated 2 weeks ago
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.β363Updated 3 weeks ago
- An agent benchmark with tasks in a simulated software company.β350Updated this week
- β155Updated last week
- Model Activity Visualiserβ477Updated last month