R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
☆78May 25, 2025Updated last year
Alternatives and similar repositories for R1-Searcher-plus
Users that are interested in R1-Searcher-plus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [SIGIR '26] Mixture-of-Retrieval Experts for Reasoning-Guided Multimodal Knowledge Exploitation☆41May 15, 2026Updated 2 weeks ago
- ☆25Dec 13, 2024Updated last year
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆712Aug 5, 2025Updated 9 months ago
- ☆33May 27, 2025Updated last year
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆39Feb 4, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 6 months ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆17Oct 20, 2025Updated 7 months ago
- This is the code of a agentic rag method with dynamic workflow.☆14Jan 22, 2026Updated 4 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆754May 10, 2026Updated 2 weeks ago
- ☆19Mar 23, 2025Updated last year
- [𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐩𝐮𝐭𝐚𝐭𝐢𝐨𝐧𝐚𝐥 𝐒𝐜𝐢𝐞𝐧𝐜𝐞] ⚡️ PSE/PSRN: Fast and efficient symbolic expression discovery through paralleliz…☆22May 17, 2026Updated last week
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 3 months ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆120Jun 3, 2025Updated 11 months ago
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆47Dec 17, 2025Updated 5 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search [SIGIR 2026]☆63Jul 4, 2025Updated 10 months ago
- [ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning☆31Jan 4, 2026Updated 4 months ago
- Official repository for RAG-Gym☆123Mar 4, 2025Updated last year
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆46Aug 25, 2025Updated 9 months ago
- Official implementation of the NeurIPS 2024 paper CORY☆33Mar 4, 2026Updated 2 months ago
- GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators☆59Dec 23, 2025Updated 5 months ago
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated last year
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆46Jun 24, 2025Updated 11 months ago
- Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.☆26Jun 6, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repo for paper "Agentic-R: Learning to Retrieve for Agentic Search" (ACL 2026 Findings)☆83Apr 9, 2026Updated last month
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆12Apr 18, 2025Updated last year
- ☆41Mar 6, 2026Updated 2 months ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆41Jun 24, 2025Updated 11 months ago
- survery of small language models☆18Jul 23, 2024Updated last year
- [ACL '24] Source code for paper: INTERVENOR : Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing☆30Nov 25, 2024Updated last year
- Some example codes for drawing figures in research paper☆35Mar 3, 2022Updated 4 years ago
- ☆19Jun 14, 2024Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [EMNLP '25] Source code for paper "ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance"☆41Aug 13, 2025Updated 9 months ago
- ☆26Oct 9, 2025Updated 7 months ago
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆51Sep 4, 2025Updated 8 months ago
- ☆16May 18, 2026Updated last week
- ☆16Jul 29, 2025Updated 10 months ago
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,224Nov 17, 2025Updated 6 months ago
- ☆11Apr 10, 2023Updated 3 years ago