ydyhello/Awesome-VLM-Streaming-Video

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ydyhello/Awesome-VLM-Streaming-Video)

ydyhello / Awesome-VLM-Streaming-Video

📚 A curated collection of papers and open-source code repositories dedicated to the application of Vision-Language Models (VLMs) for streaming video.

☆188

Alternatives and similar repositories for Awesome-VLM-Streaming-Video

Users that are interested in Awesome-VLM-Streaming-Video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sotayang / Awesome-Streaming-Video-Understanding
View on GitHub
🔥🔥🔥 [Awesome] Latest Papers, Codes & Datasets on Streaming / Online Video Understanding — Building Always-on, Real-time Video AI 🤖
☆409Jul 2, 2026Updated 2 weeks ago
xiangbo05 / MemoryDial_Public
View on GitHub
MemoryDial
☆15Mar 10, 2026Updated 4 months ago
zxc123cc / TrendFact
View on GitHub
ACL26 Long Paper
☆18Jul 4, 2026Updated 2 weeks ago
xiZAIzai / JailExpert
View on GitHub
This is the official repository for JailExpert
☆23Sep 9, 2025Updated 10 months ago
xvolcano02 / UCAS
View on GitHub
[ACL2026] UCAS: Uncertainty-aware Advantage Shaping for RLVR
☆31Apr 14, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Sundiasy / TopoDIM
View on GitHub
[ACL26 Findings] TopoDIM: One-shot Topology Generation of Diverse Interaction Modes for Multi-Agent Systems
☆19Jan 19, 2026Updated 6 months ago
EvolvingLMMs-Lab / SimpleStream
View on GitHub
A simple video streaming baseline that outperforms SOTAs.
☆148May 1, 2026Updated 2 months ago
maifoundations / Streamo
View on GitHub
Streaming Video Instruction Tuning
☆78Feb 25, 2026Updated 4 months ago
lijunxian111 / IAG
View on GitHub
[🏆CVPR'26] Official Repo for IAG: Input-aware Backdoor Attack on VLM-based Visual Grounding
☆32Jun 2, 2026Updated last month
William030422 / Video-Sycophancy
View on GitHub
Implementation for paper Flattery in Motion: Benchmarking and Analyzing Sycophancy in Video-LLMs, which is accepted by ACL 2026 (main con…
☆16Oct 10, 2025Updated 9 months ago
OzymandiasChen / PCGR
View on GitHub
Prototype Conditioned Generative Replay for Continual Learning in NLP - NAACL 2025
☆26Apr 9, 2026Updated 3 months ago
haowei-freesky / HERMES
View on GitHub
Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]
☆92May 8, 2026Updated 2 months ago
LJungang / Awesome-Video-Reasoning-Landscape
View on GitHub
🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.
☆188Jun 14, 2026Updated last month
jicoder-nwpu / STRIDE-ED
View on GitHub
Data and Code Repository for “STRIDE-ED: A Strategy-Grounded Stepwise Reasoning Framework for Empathetic Dialogue Systems”
☆17Apr 17, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OpenMOSS / MOSS-Video-Preview
View on GitHub
A real-time video understanding foundation model with gated cross-attention. Offline & real-time inference.
☆157Updated this week
yaolinli / TimeChat-Online
View on GitHub
[ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos
☆132Jun 29, 2026Updated 3 weeks ago
ZHUWEI-hub / GUARD
View on GitHub
[ACL 2026] Dissecting Failure Dynamics in Large Language Model Reasoning
☆18Apr 17, 2026Updated 3 months ago
ZHUWEI-hub / SYMPHONY
View on GitHub
[NeurIPS 2025] SYMPHONY: Synergistic Multi-agent Planning with Heterogeneous Language Model Assembly
☆16Oct 22, 2025Updated 8 months ago
JoeLeelyf / OVO-Bench
View on GitHub
[CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
☆153Jul 24, 2025Updated 11 months ago
YIGE24 / StreamingTOM
View on GitHub
☆26Mar 5, 2026Updated 4 months ago
Becomebright / ReKV
View on GitHub
[ICLR'25] Streaming Video Question-Answering with In-context Video KV-Cache Retrieval
☆121Nov 4, 2025Updated 8 months ago
Yang011013 / Awesome-Streaming-Video-Understanding
View on GitHub
Awesome latest models, datasets and benchmarks on streaming/online video understanding.
☆31Oct 19, 2025Updated 9 months ago
lern-to-write / STC
View on GitHub
[CVPR 2026] Accelerating Streaming Video Large Language Models via Hierarchical Token Compression
☆70Jun 8, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
SooLab / EyeWO
View on GitHub
[NeurIPS2025] The official PyTorch implementation of the "Eyes Wide Open: Ego Proactive Video-LLM for Streaming Video".
☆34Dec 25, 2025Updated 6 months ago
ShareLab-SII / FluxMem
View on GitHub
[CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding
☆73Mar 16, 2026Updated 4 months ago
cyuQ1n / EasyVideoR1
View on GitHub
☆155Apr 27, 2026Updated 2 months ago
MCG-NJU / StreamForest
View on GitHub
[NeurIPS 2025 Spotlight] StreamForest: Efficient Online Video Understanding with Persistent Event Memory
☆131Nov 4, 2025Updated 8 months ago
aurateam2026 / AURA
View on GitHub
☆114Jun 5, 2026Updated last month
dingyue772 / OmniSIFT
View on GitHub
[ICML2026] OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models
☆25May 21, 2026Updated last month
CASIA-IVA-Lab / ThinkStream
View on GitHub
☆40Jun 18, 2026Updated last month
meituan-longcat / General365
View on GitHub
This is the official repo for the paper "General365: Benchmarking General Reasoning in LLMs under High Difficulty and Diversity".
☆85Apr 14, 2026Updated 3 months ago
sarendis56 / Jailbreak_Detection_RCS
View on GitHub
Official Codebase of the ACL 2026 Oral paper "Rethinking Jailbreak Detection of Large Vision Language Models with Representational Contra…
☆26Jun 25, 2026Updated 3 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OzymandiasChen / ActorMind
View on GitHub
ActorMind: Emulating Human Actor Reasoning for Speech Role-Playing - ACL Findings 2026
☆25Updated this week
zapqqqwe / VideoPro_code
View on GitHub
videoPro: Adaptive Program Reasoning for Long Video Understanding
☆45Apr 15, 2026Updated 3 months ago
xinyouu / V-CAST
View on GitHub
V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video Large Language Models
☆34Apr 16, 2026Updated 3 months ago
wangruohui / EfficientVideoAgent
View on GitHub
EVA: Efficient Reinforcement Learning for End-to-End Video Agent
☆26May 6, 2026Updated 2 months ago
air-embodied-brain / Em-Garde
View on GitHub
Implementation of Em_Garde: a proposal-retrieval framework for streaming video understanding
☆26Jun 24, 2026Updated 3 weeks ago
xuyang-liu16 / VidCom2
View on GitHub
[EMNLP 2025 Main] Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models
☆127May 14, 2026Updated 2 months ago
cokeshao / Awesome-Multimodal-Token-Compression
View on GitHub
[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198
☆371May 29, 2026Updated last month