OPPO-PersonalAI/FINDER_DEFT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OPPO-PersonalAI/FINDER_DEFT)

OPPO-PersonalAI / FINDER_DEFT

Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"

☆66

Alternatives and similar repositories for FINDER_DEFT

Users that are interested in FINDER_DEFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cxcscmu / deepresearch_benchmarking
View on GitHub
☆29Mar 10, 2026Updated 4 months ago
NJU-LINK / DR3-Eval
View on GitHub
☆39May 7, 2026Updated 2 months ago
Rainier-rq / verl-if
View on GitHub
Official implementation of the paper "Instructions are all you need: Self-supervised Reinforcement Learning for Instruction Following"
☆40Jan 11, 2026Updated 6 months ago
OPPO-PersonalAI / Flash-Searcher
View on GitHub
Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution
☆88Dec 8, 2025Updated 7 months ago
hkust-nlp / WebExplorer
View on GitHub
The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"
☆120Sep 29, 2025Updated 9 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
liushulinle / MarsRL
View on GitHub
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism
☆18Nov 18, 2025Updated 8 months ago
multimodal-art-projection / CodeCriticBench
View on GitHub
☆16Nov 1, 2025Updated 8 months ago
MobileLLM / ParaThinker
View on GitHub
☆48Nov 1, 2025Updated 8 months ago
FractalAIResearchLabs / Fathom-DeepResearch
View on GitHub
Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval And Synthesis For SLMs
☆62Oct 7, 2025Updated 9 months ago
Quehry / HelloBench
View on GitHub
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
☆60Nov 26, 2024Updated last year
texttron / AgentIR
View on GitHub
AgentIR is a retriever specialized for Deep Research agents.
☆62Apr 16, 2026Updated 3 months ago
open-compass / CompassVerifier
View on GitHub
[EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward
☆68Aug 10, 2025Updated 11 months ago
stepfun-ai / StepDeepResearch
View on GitHub
Step-DeepResearch
☆570Mar 24, 2026Updated 4 months ago
Fu-Dayuan / AgentRefine
View on GitHub
(ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning
☆20Nov 22, 2025Updated 8 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
scaleapi / researchrubrics
View on GitHub
Code repository for ICLR 2026 paper "ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents" (https://ww…
☆28Feb 10, 2026Updated 5 months ago
RUCAIBox / R1-Searcher-plus
View on GitHub
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
☆82May 25, 2025Updated last year
open-compass / ProSA
View on GitHub
[EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs
☆29May 22, 2025Updated last year
psinger / kaggle-curriculum-solution
View on GitHub
☆17Mar 24, 2023Updated 3 years ago
OPPO-PersonalAI / O-Mem
View on GitHub
☆71Dec 11, 2025Updated 7 months ago
NJU-LINK / IF-VidCap
View on GitHub
The Source Code for IF-VidCap @ICLR 2026
☆19Oct 22, 2025Updated 9 months ago
RUC-NLPIR / iAgent
View on GitHub
Including 12+ cutting-edge agent systems across multiple research directions
☆35Nov 10, 2025Updated 8 months ago
OPPO-PersonalAI / OAgents
View on GitHub
Implementation for OAgents: An Empirical Study of Building Effective Agents
☆327Oct 13, 2025Updated 9 months ago
rlresearch / dr-tulu
View on GitHub
Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
☆690Jun 17, 2026Updated last month
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
petergpt / lean-3d-game-axiom
View on GitHub
A playable 3D voxel game built in Lean 4.
☆21Jul 12, 2026Updated last week
Ayanami0730 / deep_research_bench
View on GitHub
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
☆797May 11, 2026Updated 2 months ago
open-compass / Creation-MMBench
View on GitHub
Assessing Context-Aware Creative Intelligence in MLLMs
☆23Jul 22, 2025Updated last year
NJU-LINK / WebCompass
View on GitHub
The Source Code for WebCompass
☆21May 2, 2026Updated 2 months ago
multimodal-art-projection / COIG-P
View on GitHub
☆42Jul 15, 2025Updated last year
mangopy / Deep-Research-Survey
View on GitHub
A Systematic Survey of Deep Research
☆318Jan 1, 2026Updated 6 months ago
howard-yen / SLIM
View on GitHub
☆27Jun 22, 2026Updated last month
OscarXZQ / delta_activations
View on GitHub
Official code release for Delta Activations: A Representation for Finetuned Large Language Models
☆20Sep 5, 2025Updated 10 months ago
texttron / BrowseComp-Plus
View on GitHub
BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent (ACL 2026 Main)
☆319May 28, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
open-compass / GPassK
View on GitHub
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆33Aug 5, 2025Updated 11 months ago
kangreen0210 / LIME
View on GitHub
Accelerating the development of large multimodal models (LMMs) with lmms-eval
☆14Oct 14, 2024Updated last year
youdotcom-oss / ydc-deep-research-evals
View on GitHub
you.com's framework for evaluating deep research systems.
☆75May 15, 2025Updated last year
SparksJoe / Prism
View on GitHub
A Framework for Decoupling and Assessing the Capabilities of VLMs
☆44Jun 28, 2024Updated 2 years ago
SharkSpicy-NLP / SR-KI
View on GitHub
SR-KI: Scalable and Real-Time Knowledge Integration into LLMs via Supervised Attention
☆56Dec 6, 2025Updated 7 months ago
JingMog / THOR
View on GitHub
[ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".
☆33Feb 26, 2026Updated 4 months ago
YuyaoZhangQAQ / QCompiler
View on GitHub
This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.
☆17Oct 20, 2025Updated 9 months ago