pat-jj / DeepRetrievalLinks
[COLMβ25] DeepRetrieval β π₯ Training Search Agent by RLVR with Retrieval Outcome
β695Updated 3 months ago
Alternatives and similar repositories for DeepRetrieval
Users that are interested in DeepRetrieval are comparing it to the libraries listed below
Sorting:
- adds Sequence Parallelism into LLaMA-Factoryβ603Updated 3 months ago
- β333Updated 5 months ago
- [EMNLP'25] s3 - β‘ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)β811Updated 3 months ago
- Complex Reasoning Rag System, Agentic Rag Systemβ248Updated 2 weeks ago
- β559Updated 4 months ago
- [NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTSβ1,238Updated 2 weeks ago
- β¨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framworkβ310Updated 4 months ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414β489Updated 3 months ago
- Train your Agent model via our easy and efficient frameworkβ1,697Updated 2 months ago
- A scalable, end-to-end training pipeline for general-purpose agentsβ365Updated 7 months ago
- Source of LinearRAG at ICLR'26β333Updated this week
- (ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automatβ¦β316Updated 5 months ago
- [EMNLP2025] From Automation to Autonomy: A Survey on Large Language Models in Scientific Discoveryβ296Updated 3 months ago
- When Agent Becomes the Scientist β Building Closed-Loop System from Hypothesis to Verificationβ841Updated 2 months ago
- Codebase for Iterative DPO Using Rule-based Rewardsβ267Updated 9 months ago
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"β561Updated 6 months ago
- [NeurIPS 2025] A Graph-based LLM Framework for Real-world SE Tasksβ518Updated 4 months ago
- β46Updated 10 months ago
- In-depth study of the graphragβ1,507Updated 7 months ago
- β59Updated 2 weeks ago
- Open source code for ICLR 2026 Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactionsβ216Updated last week
- A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.β177Updated 6 months ago
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098β¦β317Updated 6 months ago
- [NeurIPS 2025 Main] SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applicationsβ1,084Updated 3 weeks ago
- [ICLR 2026] Tree Search for LLM Agent Reinforcement Learningβ276Updated last week
- The official implementation of Self-Play Preference Optimization (SPPO)β582Updated last year
- β1,115Updated 2 weeks ago
- The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaβ¦β139Updated this week
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Modelsβ152Updated last year
- β¨β¨ [ICLR 2026] R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learningβ277Updated 8 months ago