pat-jj / DeepRetrievalLinks

[COLM'25] DeepRetrieval - 🔥 Training Search Agent with Retrieval Outcomes via Reinforcement Learning

☆600

Alternatives and similar repositories for DeepRetrieval

Users that are interested in DeepRetrieval are comparing it to the libraries listed below

Sorting:

Qihoo360 / 360-LLaMA-Factory
adds Sequence Parallelism into LLaMA-Factory
☆535Updated last week
Simple-Efficient / RL-Factory
Train your Agent model via our easy and efficient framework
☆1,305Updated last week
Alpha-Innovator / SurveyForge
(ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automat…
☆276Updated last month
HJYao00 / Mulberry
Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS
☆1,206Updated 4 months ago
KodCode-AI / kodcode
✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork
☆248Updated 2 weeks ago
cmriat / l0
A scalable, end-to-end training pipeline for general-purpose agents
☆346Updated 3 weeks ago
HKUST-KnowComp / Awesome-LLM-Scientific-Discovery
From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery
☆205Updated 3 weeks ago
HKUST-KnowComp / AutoSchemaKG
This repository contains the implementation of AutoSchemaKG, a novel framework for automatic knowledge graph construction that combines s…
☆436Updated this week
luo-junyu / Awesome-Agent-Papers
[Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges
☆1,302Updated 3 weeks ago
mlpod / OpenSFT
☆45Updated 4 months ago
HKUDS / SepLLM
[ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"
☆517Updated this week
RLHFlow / Online-DPO-R1
Codebase for Iterative DPO Using Rule-based Rewards
☆253Updated 3 months ago
JayLZhou / GraphRAG
In-depth study of the graphrag
☆1,382Updated last month
bird-bench / BIRD-CRITIC-1
BIRD-CRITIC 1.0: Can Large Language Models Solve USER SQL Issues in Real-World Database Applications?
☆765Updated 3 weeks ago
yfzhang114 / r1_reward
✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
☆246Updated 2 months ago
Alpha-Innovator / InternAgent
When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification
☆478Updated this week
Alpha-Innovator / DocGenome
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models
☆142Updated 6 months ago
gersteinlab / ML-Bench
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…
☆302Updated this week
codefuse-ai / CodeFuse-CGM
☆359Updated last month
PKU-YuanGroup / Machine-Mindset
An MBTI Exploration of Large Language Models
☆492Updated last year
HITsz-TMG / UMOE-Scaling-Unified-Multimodal-LLMs
The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"
☆745Updated 2 months ago
URSA-MATH / URSA-MATH
☆63Updated 4 months ago
Zefan-Cai / R-KV
R-KV: Redundancy-aware KV Cache Compression for Reasoning Models
☆1,097Updated 3 weeks ago
Tencent-Hunyuan / ArtifactsBenchmark
☆210Updated last week
mragbench / MRAG-Bench
[ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models
☆52Updated 6 months ago
rllm-team / rllm
Pytorch Library for Relational Table Learning with LLMs.
☆432Updated 3 weeks ago
EvoAgentX / EvoAgentX
🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents
☆1,045Updated this week
pat-jj / s3
s3 - Efficient Yet Effective Search Agent Training via RL for RAG
☆480Updated this week
uclaml / SPPO
The official implementation of Self-Play Preference Optimization (SPPO)
☆570Updated 6 months ago
Zefan-Cai / KVCache-Factory
Unified KV Cache Compression Methods for Auto-Regressive Models
☆1,216Updated 6 months ago