pat-jj / DeepRetrievalLinks
[COLM'25] DeepRetrieval - π₯ Training Search Agent with Retrieval Outcomes via Reinforcement Learning
β600Updated last month
Alternatives and similar repositories for DeepRetrieval
Users that are interested in DeepRetrieval are comparing it to the libraries listed below
Sorting:
- adds Sequence Parallelism into LLaMA-Factoryβ535Updated last week
- Train your Agent model via our easy and efficient frameworkβ1,305Updated last week
- (ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automatβ¦β276Updated last month
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTSβ1,206Updated 4 months ago
- β¨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framworkβ248Updated 2 weeks ago
- A scalable, end-to-end training pipeline for general-purpose agentsβ346Updated 3 weeks ago
- From Automation to Autonomy: A Survey on Large Language Models in Scientific Discoveryβ205Updated 3 weeks ago
- This repository contains the implementation of AutoSchemaKG, a novel framework for automatic knowledge graph construction that combines sβ¦β436Updated this week
- [Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challengesβ1,302Updated 3 weeks ago
- β45Updated 4 months ago
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"β517Updated this week
- Codebase for Iterative DPO Using Rule-based Rewardsβ253Updated 3 months ago
- In-depth study of the graphragβ1,382Updated last month
- BIRD-CRITIC 1.0: Can Large Language Models Solve USER SQL Issues in Real-World Database Applications?β765Updated 3 weeks ago
- β¨β¨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learningβ246Updated 2 months ago
- When Agent Becomes the Scientist β Building Closed-Loop System from Hypothesis to Verificationβ478Updated this week
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Modelsβ142Updated 6 months ago
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098β¦β302Updated this week
- β359Updated last month
- An MBTI Exploration of Large Language Modelsβ492Updated last year
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"β745Updated 2 months ago
- β63Updated 4 months ago
- R-KV: Redundancy-aware KV Cache Compression for Reasoning Modelsβ1,097Updated 3 weeks ago
- β210Updated last week
- [ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Modelsβ52Updated 6 months ago
- Pytorch Library for Relational Table Learning with LLMs.β432Updated 3 weeks ago
- π EvoAgentX: Building a Self-Evolving Ecosystem of AI Agentsβ1,045Updated this week
- s3 - Efficient Yet Effective Search Agent Training via RL for RAGβ480Updated this week
- The official implementation of Self-Play Preference Optimization (SPPO)β570Updated 6 months ago
- Unified KV Cache Compression Methods for Auto-Regressive Modelsβ1,216Updated 6 months ago