pat-jj / DeepRetrievalLinks
[COLMβ25] DeepRetrieval β π₯ The First Search Agent Trained by On-Policy Reinforcement Learning
β666Updated last month
Alternatives and similar repositories for DeepRetrieval
Users that are interested in DeepRetrieval are comparing it to the libraries listed below
Sorting:
- adds Sequence Parallelism into LLaMA-Factoryβ588Updated last month
- (ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automatβ¦β299Updated 2 months ago
- β322Updated 2 months ago
- Complex Reasoning Rag System, Agentic Rag Systemβ203Updated this week
- [EMNLP'25] s3 - β‘ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)β782Updated last week
- β502Updated last month
- β¨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framworkβ289Updated 2 months ago
- [NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTSβ1,224Updated last month
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414β442Updated 3 weeks ago
- [EMNLP2025] From Automation to Autonomy: A Survey on Large Language Models in Scientific Discoveryβ258Updated last week
- A scalable, end-to-end training pipeline for general-purpose agentsβ361Updated 4 months ago
- Train your Agent model via our easy and efficient frameworkβ1,613Updated last week
- When Agent Becomes the Scientist β Building Closed-Loop System from Hypothesis to Verificationβ778Updated 3 weeks ago
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"β554Updated 3 months ago
- β46Updated 7 months ago
- Codebase for Iterative DPO Using Rule-based Rewardsβ260Updated 7 months ago
- [NeurIPS 2025] A Graph-based LLM Framework for Real-world SE Tasksβ487Updated last month
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098β¦β304Updated 3 months ago
- [NeurIPS 2025 Main] SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applicationsβ772Updated last month
- β915Updated this week
- In-depth study of the graphragβ1,448Updated 4 months ago
- RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of autβ¦β424Updated last week
- [AAAI'26, Oral] Code for "Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learniβ¦β39Updated 3 months ago
- MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.β527Updated last week
- β52Updated this week
- This repository contains the implementation of AutoSchemaKG, a novel framework for automatic knowledge graph construction that combines sβ¦β597Updated 2 weeks ago
- [ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Modelsβ61Updated 9 months ago
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Modelsβ147Updated 10 months ago
- β¨β¨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learningβ267Updated 6 months ago
- GraphRAG-Bench, the official repo of comprehensive benchmark and dataset for evaluating GraphRAG models.β257Updated 2 weeks ago