texttron/BrowseComp-Plus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/texttron/BrowseComp-Plus)

texttron / BrowseComp-Plus

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent (ACL 2026 Main)

☆318

Alternatives and similar repositories for BrowseComp-Plus

Users that are interested in BrowseComp-Plus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Hannibal046 / GPT-OSS-BrowseCompPlus-Eval
View on GitHub
Evaluating GPT-OSS on BrowseComp-Plus with Native Browsering Tools
☆20Oct 17, 2025Updated 9 months ago
hkust-nlp / WebExplorer
View on GitHub
The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"
☆120Sep 29, 2025Updated 9 months ago
inclusionAI / ASearcher
View on GitHub
An Open-Source Large-Scale Reinforcement Learning Project for Search Agents
☆602Nov 26, 2025Updated 7 months ago
texttron / AgentIR
View on GitHub
AgentIR is a retriever specialized for Deep Research agents.
☆62Apr 16, 2026Updated 3 months ago
xlang-ai / BRIGHT
View on GitHub
[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
☆206Sep 13, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
texttron / RISE
View on GitHub
Retrieving Interaction SpacE for Agentic Search
☆26Jun 8, 2026Updated last month
PALIN2018 / BrowseComp-ZH
View on GitHub
☆158May 14, 2025Updated last year
RedSearchAgent / DeepTraceHub
View on GitHub
RedSearcher's framework for deep search agent trajectory synthesis, QA filtering, and model evaluation, supporting ReACT and DeepSeek-sty…
☆23Feb 26, 2026Updated 4 months ago
GAIR-NLP / DeepResearcher
View on GitHub
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
☆782May 10, 2026Updated 2 months ago
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆5,130Nov 13, 2025Updated 8 months ago
Ayanami0730 / deep_research_bench
View on GitHub
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
☆793May 11, 2026Updated 2 months ago
RedSearchAgent / REDSearcher
View on GitHub
REDSearch: A scalable, cost-efficient framework for long-horizon search agents. Features complex task synthesis, optimized mid-training, …
☆129Feb 26, 2026Updated 4 months ago
TIGER-AI-Lab / OpenResearcher
View on GitHub
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis
☆1,092Jun 10, 2026Updated last month
microsoft / InfoAgent
View on GitHub
☆70Feb 6, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
THUDM / DeepDive
View on GitHub
DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL
☆333Jun 17, 2026Updated last month
VectorSpaceLab / Infomatica
View on GitHub
Data Synthesis for Deep Research Based on Semi-Structured Data
☆213Jul 14, 2026Updated last week
ByteDance-Seed / WideSearch
View on GitHub
WideSearch: Benchmarking Agentic Broad Info-Seeking
☆148Oct 9, 2025Updated 9 months ago
VectorSpaceLab / agentic-search
View on GitHub
Advancing search on top of AI agents
☆31Jun 9, 2026Updated last month
prnake / kimi-deepresearch
View on GitHub
Kimi K2 Thinking Agentic Search Unofficial Implementation
☆15Nov 9, 2025Updated 8 months ago
sierra-research / tau-bench
View on GitHub
Code and Data for Tau-Bench
☆1,337Mar 18, 2026Updated 4 months ago
sjtu-sai-agents / Browse-Master
View on GitHub
Official implementation of Browse-Master, a tool-augmented web-search agent.
☆35Aug 22, 2025Updated 10 months ago
GasolSun36 / PyRAG
View on GitHub
Retrieval is CheapShow Me the Code: Executable Multi-Hop Reasoning for Retrieval-Augmented Generation
☆26May 14, 2026Updated 2 months ago
OPPO-PersonalAI / FINDER_DEFT
View on GitHub
Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"
☆65Dec 10, 2025Updated 7 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
HansiZeng / scaling-retriever
View on GitHub
[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"
☆22Mar 31, 2025Updated last year
TsinghuaC3I / SSRL
View on GitHub
SSRL: Self-Search Reinforcement Learning
☆210Aug 20, 2025Updated 11 months ago
AIR-Bench / AIR-Bench
View on GitHub
[ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
☆167Mar 29, 2026Updated 3 months ago
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,569Updated this week
sunnweiwei / FoldAgent
View on GitHub
[ICML'26] Scaling Long-Horizon LLM Agent via Context-Folding
☆178May 18, 2026Updated 2 months ago
RUC-NLPIR / iAgent
View on GitHub
Including 12+ cutting-edge agent systems across multiple research directions
☆35Nov 10, 2025Updated 8 months ago
Agent-RL / ReCall
View on GitHub
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…
☆1,410May 16, 2025Updated last year
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆742Updated this week
cxcscmu / deepresearch_benchmarking
View on GitHub
☆29Mar 10, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / ReasonIR
View on GitHub
Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".
☆230Jul 2, 2026Updated 2 weeks ago
HansiZeng / PAG
View on GitHub
[SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous …
☆32Apr 24, 2024Updated 2 years ago
TIGER-AI-Lab / verl-tool
View on GitHub
A version of verl to support diverse tool use [TMLR 2026]
☆1,021Updated this week
sierra-research / tau2-bench
View on GitHub
τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains
☆1,631Updated this week
langfengQ / verl-agent
View on GitHub
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…
☆2,140Jun 9, 2026Updated last month
RulinShao / retrieval-scaling
View on GitHub
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
☆226Dec 16, 2025Updated 7 months ago
SWE-Gym / SWE-Bench-Fork
View on GitHub
☆13Mar 5, 2025Updated last year