ResearcherBench: Evaluating Deep AI Research Systems on the Frontiers of Scientific Inquiry
☆47Jan 5, 2026Updated last month
Alternatives and similar repositories for ResearcherBench
Users that are interested in ResearcherBench are comparing it to the libraries listed below
Sorting:
- 研究生课《网络大数据管理理论和应用》大作业项目代码☆13Dec 31, 2022Updated 3 years ago
- [NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge☆100Updated this week
- ☆40Dec 16, 2025Updated 2 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆38Jun 23, 2025Updated 8 months ago
- [NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"☆41May 22, 2025Updated 9 months ago
- ☆41May 22, 2025Updated 9 months ago
- Asynchronous pipeline parallel optimization☆19Feb 2, 2026Updated last month
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆597Updated this week
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 4 months ago
- ☆12Sep 21, 2023Updated 2 years ago
- you.com's framework for evaluating deep research systems.☆69May 15, 2025Updated 9 months ago
- Tutorial about noisy labels for SIBGRAPI 2020☆11Nov 6, 2020Updated 5 years ago
- This is the code of a agentic rag method with dynamic workflow.☆12Jan 22, 2026Updated last month
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- A distributed stream querying engine that provides sub-millisecond stateful query at millions of queries per-second over fast-evolving li…☆10Jul 18, 2018Updated 7 years ago
- vscode-translation 翻译插件☆10Mar 3, 2022Updated 4 years ago
- LaTeX Beamer template crafted for University of Illinois Chicago☆11Dec 7, 2024Updated last year
- android Fast builds: from 10 minutes to 10 seconds 快速编译,从 10 分钟到 10 秒☆24Feb 4, 2026Updated 3 weeks ago
- A MacOS OCR Native Node.js Module☆19Oct 11, 2025Updated 4 months ago
- A curated collection of my agent-skills☆25Jan 25, 2026Updated last month
- A simple OperatingSystem☆10Sep 9, 2022Updated 3 years ago
- Practice typing In your favorite programming language☆12Apr 27, 2014Updated 11 years ago
- auth client for yuque oauth app☆10Jul 13, 2023Updated 2 years ago
- 一个用于课程小论文排版的LaTeX模板。☆10Oct 21, 2019Updated 6 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 2 years ago
- 使用flutter仿QQ的界面UI功能效果☆12Dec 27, 2023Updated 2 years ago
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆20Jul 31, 2025Updated 7 months ago
- a simple API to use CUPTI☆11Aug 19, 2025Updated 6 months ago
- ☆18May 3, 2025Updated 10 months ago
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆13May 5, 2025Updated 9 months ago
- ☆12Nov 5, 2024Updated last year
- My notes for reading leveldb☆11Apr 19, 2024Updated last year
- ☆26Jul 29, 2025Updated 7 months ago
- ☆14Oct 21, 2024Updated last year
- ☆11Sep 12, 2023Updated 2 years ago
- [ACL 2025] NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering☆22Jul 29, 2025Updated 7 months ago
- ☆13Mar 3, 2024Updated 2 years ago
- xState-based validation tool for OCF files☆15Apr 10, 2025Updated 10 months ago
- paper and code for New Directions in Cloud Programming, CIDR 2021☆11Feb 17, 2021Updated 5 years ago