DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL
☆285Oct 2, 2025Updated 4 months ago
Alternatives and similar repositories for DeepDive
Users that are interested in DeepDive are comparing it to the libraries listed below
Sorting:
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆134Jan 31, 2026Updated last month
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆68Dec 8, 2025Updated 2 months ago
- [ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents☆33Feb 1, 2026Updated 3 weeks ago
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆184Dec 11, 2025Updated 2 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 4 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆121May 6, 2025Updated 9 months ago
- Code for "APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training"☆36Dec 23, 2025Updated 2 months ago
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆632Feb 20, 2026Updated last week
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆358Jan 12, 2026Updated last month
- ☆13Nov 5, 2024Updated last year
- ☆34Dec 18, 2025Updated 2 months ago
- [ICLR 2026] Tree Search for LLM Agent Reinforcement Learning☆292Jan 26, 2026Updated last month
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 4 years ago
- Long Context Research☆26Jan 26, 2026Updated last month
- Code for "What really matters in matrix-whitening optimizers?"☆21Oct 31, 2025Updated 4 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,324May 16, 2025Updated 9 months ago
- ☆41May 22, 2025Updated 9 months ago
- Codebase accompanying the Summary of a Haystack paper.☆80Sep 20, 2024Updated last year
- MSTI☆16Mar 6, 2024Updated last year
- ☆13Jan 22, 2025Updated last year
- ☆14Apr 16, 2024Updated last year
- ☆15Jun 2, 2025Updated 8 months ago
- ☆28Feb 8, 2026Updated 2 weeks ago
- RewardAnything: Generalizable Principle-Following Reward Models☆45Jun 11, 2025Updated 8 months ago
- ☆335May 24, 2025Updated 9 months ago
- ☆29Apr 29, 2024Updated last year
- Demonstration of how to run multiple chains in Langchain Assyncronously☆12Jul 6, 2023Updated 2 years ago
- LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆74Jul 14, 2025Updated 7 months ago
- A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍☆13Apr 15, 2025Updated 10 months ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated 10 months ago
- Change Point Detection in Time Series☆14Mar 15, 2023Updated 2 years ago
- ☆16Sep 27, 2023Updated 2 years ago
- Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""☆30Oct 12, 2025Updated 4 months ago
- ☆27Jun 5, 2025Updated 8 months ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆31Updated this week
- ☆17Mar 5, 2025Updated 11 months ago
- ☆105Mar 25, 2025Updated 11 months ago
- The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"☆106Sep 29, 2025Updated 5 months ago
- Code for KaLM-Embedding models☆114Jun 30, 2025Updated 8 months ago