pat-jj / DeepRetrievalLinks
[COLMβ25] DeepRetrieval β π₯ Training Search Agent by RLVR with Retrieval Outcome
β696Updated 3 months ago
Alternatives and similar repositories for DeepRetrieval
Users that are interested in DeepRetrieval are comparing it to the libraries listed below
Sorting:
- adds Sequence Parallelism into LLaMA-Factoryβ604Updated this week
- [EMNLP'25] s3 - β‘ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)β816Updated 3 months ago
- β334Updated 5 months ago
- β559Updated 4 months ago
- Complex Reasoning Rag System, Agentic Rag Systemβ248Updated 2 weeks ago
- Source of LinearRAG at ICLR'26β333Updated last week
- β¨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framworkβ312Updated 5 months ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414β491Updated 3 months ago
- [NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTSβ1,238Updated 3 weeks ago
- (ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automatβ¦β318Updated 5 months ago
- Train your Agent model via our easy and efficient frameworkβ1,701Updated 2 months ago
- A scalable, end-to-end training pipeline for general-purpose agentsβ366Updated 7 months ago
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"β561Updated 6 months ago
- [EMNLP2025] From Automation to Autonomy: A Survey on Large Language Models in Scientific Discoveryβ296Updated 3 months ago
- When Agent Becomes the Scientist β Building Closed-Loop System from Hypothesis to Verificationβ849Updated 2 months ago
- Codebase for Iterative DPO Using Rule-based Rewardsβ267Updated 9 months ago
- In-depth study of the graphragβ1,509Updated 7 months ago
- β60Updated 3 weeks ago
- [NeurIPS 2025] A Graph-based LLM Framework for Real-world SE Tasksβ520Updated 4 months ago
- β46Updated 10 months ago
- [NeurIPS 2025 Main] SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applicationsβ1,084Updated last month
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098β¦β317Updated 6 months ago
- [arXiv'25] EraRAG: Efficient and Incremental Retrieval-Augmented Generation for Growing Corporaβ170Updated 4 months ago
- [ICLR 2026] Tree Search for LLM Agent Reinforcement Learningβ282Updated 2 weeks ago
- β1,115Updated 3 weeks ago
- A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.β177Updated 7 months ago
- [AAAI'26, Oral] Code for "Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learniβ¦β43Updated 6 months ago
- β¨β¨ [ICLR 2026] R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learningβ281Updated 9 months ago
- Open source code for ICLR 2026 Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactionsβ216Updated last week
- The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaβ¦β139Updated last week