Alibaba-NLP / OmniSearchLinks
Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
☆384Updated 6 months ago
Alternatives and similar repositories for OmniSearch
Users that are interested in OmniSearch are comparing it to the libraries listed below
Sorting:
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆377Updated last week
- [ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval☆228Updated 5 months ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆584Updated 4 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆630Updated last week
- ☆232Updated last year
- Collect every awesome work about r1!☆420Updated 5 months ago
- A live reading list for LLM data synthesis (Updated to July, 2025).☆387Updated 2 months ago
- Agentic RAG R1 Framework via Reinforcement Learning☆307Updated last month
- Parsing-free RAG supported by VLMs☆832Updated this week
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆649Updated 2 months ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆293Updated 3 months ago
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆237Updated 2 months ago
- ☆748Updated last month
- R1-onevision, a visual language model capable of deep CoT reasoning.☆569Updated 6 months ago
- ☆249Updated last year
- A Survey on Multimodal Retrieval-Augmented Generation☆394Updated last week
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆223Updated 4 months ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆247Updated 6 months ago
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆218Updated 4 months ago
- ☆365Updated last week
- a toolkit on knowledge distillation for large language models☆181Updated last week
- ☆265Updated 10 months ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆291Updated 5 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆160Updated 7 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆840Updated 3 months ago
- CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models☆337Updated 5 months ago
- Dingo: A Comprehensive AI Data Quality Evaluation Tool☆505Updated this week
- ☆200Updated 6 months ago
- a-m-team's exploration in large language modeling☆189Updated 4 months ago
- MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, Brow…☆768Updated last week