[ICCV 2025] StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition
☆63Jun 25, 2025Updated 8 months ago
Alternatives and similar repositories for StreamMind
Users that are interested in StreamMind are comparing it to the libraries listed below
Sorting:
- VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)☆646Nov 26, 2025Updated 3 months ago
- [CVPR 2025]Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction☆168Mar 23, 2025Updated 11 months ago
- [CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant☆27Dec 2, 2025Updated 3 months ago
- [CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?☆125Jul 24, 2025Updated 7 months ago
- ☆18Aug 7, 2025Updated 7 months ago
- Official implementation of paper VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interact…☆43Feb 5, 2025Updated last year
- [CVPR 2025] Online Video Understanding: OVBench and VideoChat-Online☆91Oct 7, 2025Updated 5 months ago
- LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)☆430Oct 29, 2025Updated 4 months ago
- This is the official implementation of ICCV 2025 "Flash-VStream: Efficient Real-Time Understanding for Long Video Streams"☆274Oct 15, 2025Updated 5 months ago
- StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding☆150May 16, 2025Updated 10 months ago
- ☆49Feb 25, 2026Updated 3 weeks ago
- The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization☆21May 26, 2025Updated 9 months ago
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding"☆60Updated this week
- [CVPR 2025] EgoLife: Towards Egocentric Life Assistant☆405Mar 19, 2025Updated last year
- [ICLR 2024] DMBP: Diffusion Model-Based Predictor for Robust Offline Reinforcement Learning against State Observations Perturbations.☆17May 24, 2024Updated last year
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆29Apr 16, 2024Updated last year
- [ICCV 2025] VLM4D: Towards Spatiotemporal Awareness in Vision Language Models