GAIR-NLP / DeepResearcherLinks
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
☆503Updated 2 months ago
Alternatives and similar repositories for DeepResearcher
Users that are interested in DeepResearcher are comparing it to the libraries listed below
Sorting:
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆588Updated last month
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆637Updated last month
- ☆266Updated last month
- ☆270Updated last month
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆1,064Updated last month
- A series of technical report on Slow Thinking with LLM☆706Updated last month
- Awesome Agent Training☆179Updated this week
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆121Updated 3 months ago
- AN O1 REPLICATION FOR CODING☆335Updated 7 months ago
- ☆728Updated last month
- ☆543Updated 6 months ago
- ☆147Updated 5 months ago
- ☆238Updated last month
- ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling☆447Updated last week
- ☆264Updated last year
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆197Updated last week
- This is the repository for the Tool Learning survey.☆403Updated last month
- a-m-team's exploration in large language modeling☆171Updated last month
- Collect every awesome work about r1!☆394Updated 2 months ago
- Build, evaluate and train General Multi-Agent Assistance with ease☆333Updated this week
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆976Updated last month
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆192Updated 2 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆420Updated last month
- ☆609Updated last month
- ☆138Updated 2 months ago
- Awesome Deep Research list☆197Updated 2 weeks ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆145Updated 6 months ago
- A live reading list for LLM-synthetic-data.☆307Updated this week
- An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.☆205Updated this week
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆120Updated this week