HKUDS / VideoRAGLinks
"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
☆708Updated last month
Alternatives and similar repositories for VideoRAG
Users that are interested in VideoRAG are comparing it to the libraries listed below
Sorting:
- "MiniRAG: Making RAG Simpler with Small and Free Language Models"☆1,168Updated 2 weeks ago
- "Your Fully-Automated Personal AI Assistant, and Open-Source & Cost-Efficient Alternative to OpenAI's Deep Research"☆1,007Updated 2 months ago
- Build multimodal language agents for fast prototype and production☆2,506Updated 3 months ago
- "GraphAgent: Agentic Graph Language Assistant"☆306Updated 4 months ago
- ✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model☆588Updated 3 weeks ago
- [Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges☆860Updated 2 weeks ago
- 🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents☆931Updated this week
- Awesome-GraphRAG: A curated list of resources (surveys, papers, benchmarks, and opensource projects) on graph-based retrieval-augmented g…☆1,189Updated last week
- DeepRetrieval - 🔥 Training Search Agent with Retrieval Outcomes via Reinforcement Learning☆561Updated 3 weeks ago
- Skywork-R1V2:Multimodal Hybrid Reinforcement Learning for Reasoning☆2,626Updated last week
- Train your Agent model via our easy and efficient framework☆1,144Updated this week
- (ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automat…☆234Updated 3 weeks ago
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆496Updated last week
- In-depth study of the graphrag☆1,345Updated last month
- Ola: Pushing the Frontiers of Omni-Modal Language Model☆341Updated last week
- Parsing-free RAG supported by VLMs☆741Updated 4 months ago
- LAYRA—an enterprise-ready, out-of-the-box solution—unlocks next-generation intelligent systems powered by visual RAG and limitless visual…☆732Updated this week
- Align Anything: Training All-modality Model with Feedback☆4,026Updated 3 weeks ago
- Medical o1, Towards medical complex reasoning with LLMs☆1,137Updated 5 months ago
- Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents☆367Updated 4 months ago
- Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the maki…☆979Updated 3 months ago
- This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"☆199Updated 4 months ago
- [NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions☆1,063Updated 8 months ago
- A lightweight LMM-based Document Parsing Model☆2,473Updated this week
- recursive rag with r1 reasoning☆319Updated last month
- Convert files (PDF, image, Word, PPT, Excel, notebooks, code snippets) to markdown using powerful multimodal LLM☆259Updated last month
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆172Updated this week
- ☆906Updated this week
- AI Manus is a general-purpose AI Agent system that supports running various tools and operations in a sandbox environment.☆735Updated this week
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆913Updated 3 months ago