Alibaba-NLP / OmniSearchView external linksLinks
Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
☆412Apr 22, 2025Updated 9 months ago
Alternatives and similar repositories for OmniSearch
Users that are interested in OmniSearch are comparing it to the libraries listed below
Sorting:
- Parsing-free RAG supported by VLMs☆910Dec 7, 2025Updated 2 months ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆130Nov 6, 2024Updated last year
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆91Nov 15, 2024Updated last year
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,164Nov 17, 2025Updated 2 months ago
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆441Jan 13, 2026Updated last month
- ☆189Feb 5, 2026Updated last week
- ☆16Jul 8, 2024Updated last year
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆293Aug 4, 2025Updated 6 months ago
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆488Jan 23, 2025Updated last year
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Oct 27, 2024Updated last year
- [ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models☆59Jan 22, 2025Updated last year
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,240Aug 16, 2025Updated 5 months ago
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)☆3,309Nov 26, 2025Updated 2 months ago
- ☆164Jan 21, 2025Updated last year
- ☆51May 11, 2025Updated 9 months ago
- [CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models☆233Nov 7, 2025Updated 3 months ago
- ☆483Sep 25, 2024Updated last year
- An Open Large Reasoning Model for Real-World Solutions☆1,533Feb 3, 2026Updated last week
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆25May 30, 2024Updated last year
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆233Jan 21, 2026Updated 3 weeks ago
- This is the official repository for Retrieval Augmented Visual Question Answering☆244Dec 19, 2024Updated last year
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,317May 16, 2025Updated 8 months ago
- ☆46Dec 30, 2024Updated last year
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆3,975Nov 13, 2025Updated 3 months ago
- Ola: Pushing the Frontiers of Omni-Modal Language Model☆386Jun 13, 2025Updated 8 months ago
- [EMNLP2025] "GraphAgent: Agentic Graph Language Assistant"☆338Feb 8, 2025Updated last year
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆9,792Sep 22, 2025Updated 4 months ago
- Retrieval and Retrieval-augmented LLMs☆11,280Dec 15, 2025Updated last month
- [ICCV 2025] Explore the Limits of Omni-modal Pretraining at Scale☆123Sep 2, 2024Updated last year
- Repository for the NeurIPS 2024 paper "SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up…☆26Dec 9, 2024Updated last year
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆198Aug 16, 2024Updated last year
- Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)☆177Oct 1, 2024Updated last year
- ☆237Apr 23, 2024Updated last year
- Solve Visual Understanding with Reinforced VLMs☆5,833Oct 21, 2025Updated 3 months ago
- 支持中英文双语视觉-文本对话的开源可商用多模态模型。☆378Sep 23, 2023Updated 2 years ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆148May 27, 2025Updated 8 months ago
- InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions☆2,921May 26, 2025Updated 8 months ago
- O1 Replication Journey☆2,000Jan 14, 2025Updated last year
- Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊☆272Jan 27, 2025Updated last year