Alibaba-NLP / OmniSearchLinks
Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
☆358Updated 4 months ago
Alternatives and similar repositories for OmniSearch
Users that are interested in OmniSearch are comparing it to the libraries listed below
Sorting:
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆315Updated last month
- [ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval☆219Updated 3 months ago
- Collect every awesome work about r1!☆412Updated 3 months ago
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆316Updated this week
- ☆232Updated last year
- Parsing-free RAG supported by VLMs☆770Updated 6 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆558Updated 4 months ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面 壁小钢炮” focuses on achi…☆275Updated last month
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆623Updated 2 weeks ago
- A live reading list for LLM data synthesis (Updated to July, 2025).☆360Updated this week
- Agentic RAG R1 Framework via Reinforcement Learning☆281Updated 3 months ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆303Updated last week
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆528Updated 2 months ago
- R1-onevision, a visual language model capable of deep CoT reasoning.☆557Updated 4 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆757Updated last month
- a-m-team's exploration in large language modeling☆183Updated 2 months ago
- ☆737Updated 2 months ago
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆213Updated 2 months ago
- Dingo: A Comprehensive AI Data Quality Evaluation Tool☆352Updated 2 weeks ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆227Updated 4 months ago
- A Survey on Multimodal Retrieval-Augmented Generation☆310Updated this week
- ☆317Updated 2 months ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆275Updated 3 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆286Updated last year
- ☆259Updated 8 months ago
- ☆365Updated 6 months ago
- A LLM-based Agent that predict its tasks proactively.☆410Updated 3 months ago
- ☆231Updated last year
- ☆54Updated 11 months ago
- Miroflow is an agent framework that simplifies the development of complex, multi-agent systems. Build, manage, and scale your AI agents w…☆332Updated this week