WideSearch: Benchmarking Agentic Broad Info-Seeking
☆120Oct 9, 2025Updated 4 months ago
Alternatives and similar repositories for WideSearch
Users that are interested in WideSearch are comparing it to the libraries listed below
Sorting:
- ☆15Jan 23, 2025Updated last year
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Apr 12, 2024Updated last year
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆39Sep 12, 2024Updated last year
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Jan 8, 2026Updated last month
- DELT: Data Efficacy for Language Model Training☆43Feb 12, 2026Updated 2 weeks ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆102Feb 20, 2025Updated last year
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆50Sep 4, 2025Updated 5 months ago
- ☆19Jul 21, 2025Updated 7 months ago
- AI for Mathematics Paper List☆17Jan 14, 2025Updated last year
- PAHF Personalized Agent from Human Feedback☆31Feb 17, 2026Updated last week
- [ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations☆13Sep 11, 2024Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆592Feb 16, 2026Updated last week
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆702Oct 15, 2025Updated 4 months ago
- ☆83Apr 3, 2025Updated 10 months ago
- ☆17May 31, 2023Updated 2 years ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- ☆39Jul 25, 2024Updated last year
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆184Dec 11, 2025Updated 2 months ago
- ☆123Jun 6, 2024Updated last year
- ☆84Apr 18, 2024Updated last year
- ☆16Jul 12, 2024Updated last year
- A Deep Research replica built with LangChain and LangGraph.☆15Apr 11, 2025Updated 10 months ago
- ☆17Mar 3, 2025Updated 11 months ago
- ☆17Jul 12, 2025Updated 7 months ago
- The code of CIKM 2023 (Oral Presentation) : A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NE…☆14Jul 19, 2024Updated last year
- ☆14Aug 15, 2024Updated last year
- ☆30Dec 27, 2024Updated last year
- 大模型智能体Agent中文教程,博客代码仓库☆59Nov 5, 2025Updated 3 months ago
- ☆30Feb 16, 2024Updated 2 years ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆139Jun 12, 2024Updated last year
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 3 months ago
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆22May 28, 2025Updated 9 months ago
- ☆36Sep 6, 2024Updated last year
- ☆18Apr 18, 2025Updated 10 months ago
- ☆15Jun 20, 2024Updated last year