Alibaba-NLP / OmniSearchLinks
Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
☆329Updated last month
Alternatives and similar repositories for OmniSearch
Users that are interested in OmniSearch are comparing it to the libraries listed below
Sorting:
- MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval☆181Updated last week
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆409Updated last month
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆541Updated last week
- Collect every awesome work about r1!☆372Updated last month
- Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥☆257Updated 4 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆518Updated last week
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆479Updated 2 months ago
- Agentic RAG R1 Framework via Reinforcement Learning☆187Updated last week
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆178Updated this week
- R1-onevision, a visual language model capable of deep CoT reasoning.☆524Updated last month
- ☆193Updated last week
- 🌐 WebWalker [ACL2025] & WebDancer [Preprint]☆421Updated this week
- A Survey on Multimodal Retrieval-Augmented Generation☆206Updated this week
- Parsing-free RAG supported by VLMs☆722Updated 3 months ago
- ☆221Updated last year
- ☆140Updated 4 months ago
- ☆208Updated last week
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆169Updated last month
- Awesome Agent Training☆131Updated this week
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆243Updated 7 months ago
- ☆706Updated this week
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆274Updated last year
- Explore the Multimodal “Aha Moment” on 2B Model☆589Updated 2 months ago
- Train your Agent model via our easy and efficient framework☆776Updated this week
- Latest Advances on Long Chain-of-Thought Reasoning☆343Updated this week
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆223Updated last month
- This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-sta…☆579Updated 3 weeks ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆254Updated 2 weeks ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆111Updated 2 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆142Updated 2 months ago