[Awesome] 🔥🔥🔥 Latest Papers, Codes and Datasets on Streaming / Online Video Understanding
☆120Jan 13, 2026Updated last month
Alternatives and similar repositories for Awesome-Streaming-Video-Understanding
Users that are interested in Awesome-Streaming-Video-Understanding are comparing it to the libraries listed below
Sorting:
- [EMNLP 2025 Main] Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models☆64Updated this week
- Interactively browse multimodal tabular data☆104Feb 11, 2026Updated 3 weeks ago
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding"☆59Jan 23, 2026Updated last month
- A repository for evaluating large language models as raters in large-scale writing assessments, focusing on a psychometric framework for …☆82Jan 26, 2025Updated last year
- [CVPR 2026] Accelerating Streaming Video Large Language Models via Hierarchical Token Compression☆45Feb 25, 2026Updated last week
- 封装一些golang工具库☆42Feb 4, 2026Updated last month
- 🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.☆153Jan 16, 2026Updated last month
- CVPR25☆26Jul 2, 2025Updated 8 months ago
- [ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos☆117Dec 12, 2025Updated 2 months ago
- A PubMed MCP server.☆146May 7, 2025Updated 10 months ago
- [CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?☆122Jul 24, 2025Updated 7 months ago
- ☆108Jan 9, 2022Updated 4 years ago
- This is the official implementation of ICCV 2025 "Flash-VStream: Efficient Real-Time Understanding for Long Video Streams"☆273Oct 15, 2025Updated 4 months ago
- [ICLR'25] Streaming Video Question-Answering with In-context Video KV-Cache Retrieval☆104Nov 4, 2025Updated 4 months ago
- ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer☆41Jan 29, 2026Updated last month
- [AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models☆38Jan 27, 2026Updated last month
- ☆95Oct 31, 2025Updated 4 months ago
- ☆29Aug 6, 2025Updated 7 months ago
- Control your Mac with natural language by converting intent into executable action sequences, with planning, retries, and verifiable outc…☆34Feb 8, 2026Updated last month
- Official implementation of paper VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interact…☆42Feb 5, 2025Updated last year
- For dynamic target tracking in flight videos, applicable to various types of unmanned aerial vehicle systems☆84Dec 4, 2025Updated 3 months ago
- [NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models☆71Oct 10, 2025Updated 4 months ago
- ☆16Jan 23, 2026Updated last month
- Self-similarity Prior Distillation for Unsupervised Remote Physiological Measurement☆10Oct 18, 2024Updated last year
- qiankun搭建的后台管理系统☆24Feb 6, 2026Updated last month
- ☆13Jul 3, 2024Updated last year
- Full life cycle cross providers serverless application management for your fast-growing business.☆87Updated this week
- Evaluation metrics and submission file creation scripts the Action Recognition challenge☆15Feb 9, 2026Updated 3 weeks ago
- ☆13May 15, 2025Updated 9 months ago
- This is the code corresponding to the paper "Resolve Domain Conflicts for Generalizable Remote Physiological Measurement." accepted in AC…☆15Apr 15, 2024Updated last year
- 📚 LaTeX templates and tools for creating beautiful, structured documents 📝☆14Oct 24, 2025Updated 4 months ago
- Repo for "Synergy of Sight and Semantics: Visual Intention Understanding with CLIP"☆12Mar 12, 2025Updated 11 months ago
- ☆18Aug 7, 2025Updated 7 months ago
- Claude Code skill for improving website AEO (AI Engine Optimization) and GEO (Generative Engine Optimization) scores — 16 foundational ch…☆62Updated this week
- ☆20Nov 21, 2025Updated 3 months ago
- The official repository for "SurgNet: Self-supervised Pretraining with Semantic Consistency for Vessel and Instrument Segmentation in Sur…☆14Dec 30, 2024Updated last year
- The code for the paper "Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning" (CVPR'25).☆15Sep 25, 2025Updated 5 months ago
- ☆14Sep 11, 2025Updated 5 months ago
- Collection of papers about video-audio understanding☆22Dec 26, 2025Updated 2 months ago