GAIR-NLP / DeepResearcherLinks
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
☆401Updated last month
Alternatives and similar repositories for DeepResearcher
Users that are interested in DeepResearcher are comparing it to the libraries listed below
Sorting:
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆535Updated last week
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆491Updated 2 weeks ago
- ☆191Updated last week
- ☆175Updated last month
- A series of technical report on Slow Thinking with LLM☆679Updated this week
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆873Updated 2 weeks ago
- ☆139Updated 4 months ago
- Awesome Agent Training☆128Updated last week
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆142Updated 2 months ago
- ☆198Updated last week
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆387Updated last month
- Latest Advances on Long Chain-of-Thought Reasoning☆329Updated last week
- ☆237Updated last year
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆327Updated last month
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆107Updated 2 months ago
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆892Updated 2 weeks ago
- This is the repository for the Tool Learning survey.☆383Updated last week
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆248Updated 2 weeks ago
- ☆538Updated 4 months ago
- AN O1 REPLICATION FOR CODING☆336Updated 5 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆141Updated 5 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆201Updated 3 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆98Updated 2 weeks ago
- ☆282Updated 10 months ago
- Collect every awesome work about r1!☆369Updated 3 weeks ago
- The related works and background techniques about Openai o1☆221Updated 4 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆238Updated last month
- ReasonFlux Series - Open-Sourced Strong Reasoning LLMs☆390Updated this week
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆224Updated 4 months ago
- ☆150Updated last month