Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
☆424Apr 22, 2025Updated last year
Alternatives and similar repositories for OmniSearch
Users that are interested in OmniSearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Parsing-free RAG supported by VLMs☆956Dec 7, 2025Updated 5 months ago
- Enjoy easier conversations with LLM☆46Mar 13, 2025Updated last year
- ☆15Jul 8, 2024Updated last year
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆92Nov 15, 2024Updated last year
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆26May 30, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆128Nov 6, 2024Updated last year
- ☆52May 11, 2025Updated last year
- ☆190Mar 13, 2026Updated 2 months ago
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆494Apr 5, 2026Updated last month
- Repository for the NeurIPS 2024 paper "SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up…☆26Dec 9, 2024Updated last year
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,221Nov 17, 2025Updated 6 months ago
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,273Aug 16, 2025Updated 9 months ago
- [ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models☆62Jan 22, 2025Updated last year
- [ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal…☆446Apr 7, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This is the official repository for Retrieval Augmented Visual Question Answering☆250Dec 19, 2024Updated last year
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆299Aug 4, 2025Updated 9 months ago
- ☆491Sep 25, 2024Updated last year
- Multimodal Retrieval-augmented Generation Framework Built by Tongyi Lab, Alibaba Group.☆934Apr 29, 2026Updated last month
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)☆3,490Apr 10, 2026Updated last month
- ☆48Dec 30, 2024Updated last year
- An Open Large Reasoning Model for Real-World Solutions☆1,543Feb 13, 2026Updated 3 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,753Nov 13, 2025Updated 6 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆712Aug 5, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Retrieval and Retrieval-augmented LLMs☆11,722Apr 22, 2026Updated last month
- Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)☆180Oct 1, 2024Updated last year
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,385May 16, 2025Updated last year
- ☆166Jan 21, 2025Updated last year
- [CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models☆239Nov 7, 2025Updated 6 months ago
- ☆77Oct 27, 2023Updated 2 years ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆660Jan 11, 2026Updated 4 months ago
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆10,038Sep 22, 2025Updated 8 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆153May 27, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks☆4,145May 15, 2026Updated 2 weeks ago
- [SIGIR '26] Mixture-of-Retrieval Experts for Reasoning-Guided Multimodal Knowledge Exploitation☆41May 15, 2026Updated 2 weeks ago
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…☆14,218May 22, 2026Updated last week
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,406May 30, 2025Updated 11 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆211Aug 16, 2024Updated last year
- [EMNLP2025] "GraphAgent: Agentic Graph Language Assistant"☆365Feb 8, 2025Updated last year
- Solve Visual Understanding with Reinforced VLMs☆5,959Mar 12, 2026Updated 2 months ago