RUC-NLPIR / Search-o1Links
π Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]
β1,084Updated 2 months ago
Alternatives and similar repositories for Search-o1
Users that are interested in Search-o1 are comparing it to the libraries listed below
Sorting:
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learningβ1,241Updated 5 months ago
- Integrating Tool Use into LLM Reasoningβ692Updated 8 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learningβ652Updated 3 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.β639Updated 3 weeks ago
- Parsing-free RAG supported by VLMsβ849Updated 3 weeks ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learningβ874Updated 3 months ago
- Large Reasoning Modelsβ806Updated 11 months ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiβ¦β640Updated 2 months ago
- β963Updated 9 months ago
- [Up-to-date] Awesome Agentic Deep Research Resourcesβ530Updated 2 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RLβ472Updated 5 months ago
- ZeroSearch: Incentivize the Search Capability of LLMs without Searchingβ1,184Updated 2 months ago
- An Open Large Reasoning Model for Real-World Solutionsβ1,527Updated 5 months ago
- β1,335Updated last month
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.β773Updated 3 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data β¦β786Updated 7 months ago
- Awesome Deep Research list! For more details, please refer to our survey paper -- A Comprehensive Survey of Deep Research: Systems, Methoβ¦β352Updated 2 weeks ago
- β843Updated 2 months ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agentsβ457Updated 3 months ago
- A series of technical report on Slow Thinking with LLMβ743Updated 2 months ago
- This is the repository for the Tool Learning survey.β452Updated 3 months ago
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generationβ309Updated last year
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ3,484Updated last week
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.β485Updated 2 months ago
- β413Updated 3 months ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Modelsβ668Updated 4 months ago
- β1,348Updated 11 months ago
- [COLM 2025] LIMO: Less is More for Reasoningβ1,045Updated 3 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,390Updated this week
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"β161Updated 7 months ago