RUCAIBox / R1-Searcher-plusView external linksLinks
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
☆72May 25, 2025Updated 8 months ago
Alternatives and similar repositories for R1-Searcher-plus
Users that are interested in R1-Searcher-plus are comparing it to the libraries listed below
Sorting:
- ☆25Dec 13, 2024Updated last year
- This is the code repo for the paper "Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning".☆38Aug 22, 2025Updated 5 months ago
- This is the code of a agentic rag method with dynamic workflow.☆13Jan 22, 2026Updated 3 weeks ago
- ☆16Feb 22, 2025Updated 11 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆699Oct 15, 2025Updated 4 months ago
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆32Feb 4, 2026Updated last week
- ☆18Mar 23, 2025Updated 10 months ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 9 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Feb 4, 2026Updated last week
- ☆25Jan 4, 2026Updated last month
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆118Jun 3, 2025Updated 8 months ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated 11 months ago
- FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data (NAACL 2025)☆14Jul 14, 2025Updated 7 months ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 3 months ago
- Official repository for RAG-Gym☆121Mar 4, 2025Updated 11 months ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 2 months ago
- Code and data for paper "(How) do Language Models Track State?"☆21Mar 31, 2025Updated 10 months ago
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆45Dec 17, 2025Updated last month
- Python code for the paper Machine Learning to Improve Situational Awareness in Beyond Visual Range Air Combat.☆20Jul 9, 2022Updated 3 years ago
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆20Apr 3, 2025Updated 10 months ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆28Nov 4, 2025Updated 3 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆27Mar 1, 2025Updated 11 months ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆53Aug 28, 2025Updated 5 months ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆62Jul 4, 2025Updated 7 months ago
- ☆18Jun 14, 2024Updated last year
- GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators☆46Dec 23, 2025Updated last month
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆26Aug 9, 2025Updated 6 months ago
- Exploration of automated dataset selection approaches at large scales.☆52Mar 4, 2025Updated 11 months ago
- ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation☆57Feb 2, 2026Updated last week
- ☆24Apr 3, 2025Updated 10 months ago
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆40Jul 21, 2025Updated 6 months ago
- Official Repository for Task-Circuit Quantization☆24Jun 1, 2025Updated 8 months ago
- Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.☆25Jun 6, 2025Updated 8 months ago
- ☆17Aug 1, 2025Updated 6 months ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆41Aug 25, 2025Updated 5 months ago
- Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation☆25Feb 18, 2025Updated 11 months ago
- ☆45May 27, 2025Updated 8 months ago
- This is a repository for my work on the paper "Oracle Guided Image Synthesis with Relative Queries".☆24May 6, 2022Updated 3 years ago