Multimodal Retrieval-augmented Generation Framework Built by Tongyi Lab, Alibaba Group.
☆934Apr 29, 2026Updated 3 weeks ago
Alternatives and similar repositories for VRAG
Users that are interested in VRAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆660Jan 11, 2026Updated 4 months ago
- VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning☆37May 9, 2026Updated 2 weeks ago
- Parsing-free RAG supported by VLMs☆956Dec 7, 2025Updated 5 months ago
- [ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal…☆446Apr 7, 2026Updated last month
- ☆47Apr 9, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Survey on Multimodal Retrieval-Augmented Generation☆513Feb 20, 2026Updated 3 months ago
- Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"☆175Mar 18, 2026Updated 2 months ago
- [ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…☆23Aug 28, 2025Updated 8 months ago
- ☆70May 19, 2025Updated last year
- ☆1,215Nov 20, 2025Updated 6 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆424Apr 22, 2025Updated last year
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆4,950Apr 6, 2026Updated last month
- ☆38Apr 1, 2026Updated last month
- OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.☆389Jun 1, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,753Nov 13, 2025Updated 6 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆155May 27, 2025Updated 11 months ago
- The official repository of NodeRAG☆414Mar 19, 2025Updated last year
- Official implementation of MATPO: Multi-Agent Tool-Integrated Policy Optimization.☆81Oct 31, 2025Updated 6 months ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆95Aug 8, 2025Updated 9 months ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- ☆40Apr 6, 2026Updated last month
- Agentic RAG R1 Framework via Reinforcement Learning☆413Feb 16, 2026Updated 3 months ago
- ☆533Updated this week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models☆239Nov 7, 2025Updated 6 months ago
- Tongyi Deep Research, the Leading Open-source Deep Research Agent☆18,892Feb 27, 2026Updated 2 months ago
- Code for our paper: "Building A Coding Assistant via Retrieval-Augmented Language Models"☆10Nov 2, 2024Updated last year
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆712Aug 5, 2025Updated 9 months ago
- the official repo for E^ 2GraphRAG: Streamlining Graph-based RAG for High Efficiency and Effectiveness☆159Mar 18, 2026Updated 2 months ago
- EMNLP MAIN 2025 StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization☆70Sep 13, 2025Updated 8 months ago
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 8 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆1,044May 12, 2026Updated last week
- ☆191Apr 14, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆67Jan 4, 2026Updated 4 months ago
- MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka☆324Jun 21, 2025Updated 11 months ago
- A holistic framework for advancing LLMs as data science agents☆49Updated this week
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…☆14,218Updated this week
- Solve Visual Understanding with Reinforced VLMs☆5,959Mar 12, 2026Updated 2 months ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,637Updated this week
- ☆63Jan 3, 2025Updated last year