[KDD'2026] "VideoRAG: Chat with Your Videos"
☆2,742Jan 11, 2026Updated last month
Alternatives and similar repositories for VideoRAG
Users that are interested in VideoRAG are comparing it to the libraries listed below
Sorting:
- "MiniRAG: Making RAG Simpler with Small and Open-Sourced Language Models"☆1,733Oct 16, 2025Updated 4 months ago
- [EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆28,932Updated this week
- ✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehensi…☆402Jan 14, 2026Updated last month
- [EMNLP-2024] Build multimodal language agents for fast prototype and production☆2,631Mar 19, 2025Updated 11 months ago
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆8,616Oct 16, 2025Updated 4 months ago
- Align Anything: Training All-modality Model with Feedback☆4,635Nov 27, 2025Updated 3 months ago
- [WSDM'2025] "MixRec: Heterogeneous Graph Collaborative Filtering"☆19Dec 19, 2024Updated last year
- 🧠 VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning (ICLR 2026)☆305Feb 8, 2026Updated 3 weeks ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆632Jan 11, 2026Updated last month
- Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.☆3,150Dec 15, 2025Updated 2 months ago
- UFO³: Weaving the Digital Agent Galaxy☆8,062Updated this week
- Frontier Multimodal Foundation Models for Image and Video Understanding☆1,109Aug 14, 2025Updated 6 months ago
- Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mo…☆7,741Updated this week
- "RAG-Anything: All-in-One RAG Framework"☆13,867Updated this week
- BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI…☆11,126Updated this week
- The open source platform for AI-native application development.☆5,367Dec 2, 2024Updated last year
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,645Sep 14, 2024Updated last year
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽…☆3,882Updated this week
- "Your Fully-Automated Personal AI Assistant"☆1,383Oct 16, 2025Updated 4 months ago
- PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation☆2,372Sep 10, 2025Updated 5 months ago
- [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge a…☆3,248Sep 4, 2025Updated 6 months ago
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆23,942Feb 23, 2026Updated last week
- [ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation☆3,683Feb 27, 2025Updated last year
- "ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"☆2,368Updated this week
- The next generation deep reinforcement learning tookit☆3,462Jun 16, 2023Updated 2 years ago
- [CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer☆12,484Oct 11, 2025Updated 4 months ago
- 🔥🔥First-ever hour scale video understanding models☆611Jul 14, 2025Updated 7 months ago
- The first open autoregressive foundational video AI model.☆2,891Oct 14, 2024Updated last year
- Build Real-Time Knowledge Graphs for AI Agents☆23,192Updated this week
- [EMNLP2025] "GraphAgent: Agentic Graph Language Assistant"☆341Feb 8, 2025Updated last year
- TVM Documentation in Chinese Simplified / TVM 中文文档☆3,408Nov 21, 2025Updated 3 months ago
- Klavis AI (YC X25): MCP integration platforms that let AI agents use tools reliably at any scale☆5,651Updated this week
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆18,386Jan 30, 2026Updated last month
- OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.☆9,391Dec 4, 2025Updated 3 months ago
- 🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming☆64,648Jan 21, 2026Updated last month
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆8,574Jan 28, 2026Updated last month
- A Doctor for your data☆3,489Jan 14, 2025Updated last year
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆31,162Updated this week
- "VideoAgent: All-in-One Agentic Framework for Video Understanding, Editing, and Remaking"☆460Oct 17, 2025Updated 4 months ago