HKUDS / VideoRAGLinks
"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
☆678Updated 3 weeks ago
Alternatives and similar repositories for VideoRAG
Users that are interested in VideoRAG are comparing it to the libraries listed below
Sorting:
- Build multimodal language agents for fast prototype and production☆2,491Updated 2 months ago
- "Your Fully-Automated Personal AI Assistant, and Open-Source & Cost-Efficient Alternative to OpenAI's Deep Research"☆984Updated 2 months ago
- "MiniRAG: Making RAG Simpler with Small and Free Language Models"☆1,124Updated 3 weeks ago
- "GraphAgent: Agentic Graph Language Assistant"☆303Updated 3 months ago
- Skywork-R1V2:Multimodal Hybrid Reinforcement Learning for Reasoning☆2,601Updated this week
- Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents☆362Updated 3 months ago
- LAYRA is a ready-to-use visual RAG system with a complete web UI built with Next.js and FastAPI, preserving document layout, tables, para…☆633Updated last month
- ☆884Updated 2 months ago
- [Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges☆716Updated 2 weeks ago
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆479Updated 2 months ago
- ✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model☆487Updated last week
- 🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents☆553Updated this week
- In-depth study of the graphrag☆1,328Updated 3 weeks ago
- Train your Agent model via our easy and efficient framework☆776Updated this week
- Awesome-GraphRAG: A curated list of resources (surveys, papers, benchmarks, and opensource projects) on graph-based retrieval-augmented g…☆1,131Updated 2 weeks ago
- Parsing-free RAG supported by VLMs☆716Updated 3 months ago
- free and open OpenAI Deep Research☆565Updated 3 months ago
- Ola: Pushing the Frontiers of Omni-Modal Language Model☆337Updated 3 months ago
- Align Anything: Training All-modality Model with Feedback☆3,814Updated this week
- DeepRetrieval - 🔥 Training Search Agent with Retrieval Outcomes via Reinforcement Learning☆521Updated this week
- 🌐 WebWalker [ACL2025] & WebDancer [Preprint]☆421Updated this week
- [NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions☆1,057Updated 7 months ago
- A LLM-based Agent that predict its tasks proactively.☆367Updated last week
- An Innovative Agent Framework Driven by KG Engine☆763Updated 4 months ago
- Medical o1, Towards medical complex reasoning with LLMs☆1,116Updated 4 months ago
- OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆450Updated last month
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,754Updated 4 months ago
- Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the maki…☆964Updated 2 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆329Updated last month
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆911Updated 2 months ago