RUCAIBox / R1-Searcher
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
☆455Updated this week
Alternatives and similar repositories for R1-Searcher:
Users that are interested in R1-Searcher are comparing it to the libraries listed below
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆244Updated last week
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning☆676Updated last week
- A series of technical report on Slow Thinking with LLM☆644Updated last week
- ☆381Updated this week
- Large Reasoning Models☆802Updated 4 months ago
- ☆518Updated 3 months ago
- AN O1 REPLICATION FOR CODING☆334Updated 4 months ago
- ☆267Updated 8 months ago
- Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥☆249Updated 2 months ago
- ☆659Updated last week
- ☆135Updated 3 weeks ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆236Updated this week
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆300Updated last month
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆130Updated 3 months ago
- 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability☆140Updated last week
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆172Updated last week
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆97Updated last month
- ☆282Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆190Updated last month
- A Survey on Efficient Reasoning for LLMs☆319Updated 2 weeks ago
- Collect every awesome work about r1!☆341Updated 3 weeks ago
- ☆126Updated 3 months ago
- ☆630Updated 3 weeks ago
- ☆142Updated last month
- ☆920Updated 2 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆223Updated last week
- ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates☆373Updated 2 weeks ago
- The related works and background techniques about Openai o1☆219Updated 3 months ago
- This is the repository for the Tool Learning survey.☆359Updated last month
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆805Updated 3 weeks ago