๐ A curated collection of papers and open-source code repositories dedicated to the application of Vision-Language Models (VLMs) for streaming video.
โ167May 31, 2026Updated last week
Alternatives and similar repositories for Awesome-VLM-Streaming-Video
Users that are interested in Awesome-VLM-Streaming-Video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] FOCUS: Efficient Keyframe Selection for Long Video Understandingโ70Apr 23, 2026Updated last month
- [CVPR 2026 Highlight] VideoITG: Multimodal Video Understanding with Instructed Temporal Groundingโ120Apr 17, 2026Updated last month
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"โ48Oct 9, 2025Updated 7 months ago
- [๐๐๐ญ๐ฎ๐ซ๐ ๐๐จ๐ฆ๐ฉ๐ฎ๐ญ๐๐ญ๐ข๐จ๐ง๐๐ฅ ๐๐๐ข๐๐ง๐๐] โก๏ธ PSE/PSRN: Fast and efficient symbolic expression discovery through parallelizโฆโ22May 17, 2026Updated 3 weeks ago
- [Roadmap] Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modelingโ112May 13, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [๐๐๐ญ๐ฎ๐ซ๐ ๐๐จ๐ฆ๐ฆ๐ฎ๐ง๐ข๐๐๐ญ๐ข๐จ๐ง๐ฌ] ๐ค๐ก LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Cโฆโ27Apr 21, 2026Updated last month
- Awesome papers for affective computing with llm and mllmโ27Nov 26, 2025Updated 6 months ago
- โ12Apr 29, 2024Updated 2 years ago
- Visual Speech Recongnitionโ20Dec 24, 2024Updated last year
- โ24Jun 1, 2026Updated last week
- LineArt, a framework that transfers complex appearance onto detailed design drawings, facilitating design and artistic creation.โ15Oct 2, 2025Updated 8 months ago
- ๅฝ็งๅคง้ๆ ๆนๆ กๅบ2024~2025ๅนด่ฏพ็จ่ตๆ๏ผๅ ๆฌๅผบๅๅญฆไน ใๆบ่ฝ่ฎก็ฎ็ณป็ปใๆจกๅผ่ฏๅซใ็ฉ้ตๅๆไธๅบ็จใไบบๅทฅๆบ่ฝๅ็ไธ็ฎๆณใ่ช็ถ่ฏญ่จๅค็โ42Sep 22, 2025Updated 8 months ago
- โ18May 18, 2026Updated 3 weeks ago
- [CVPR2022] Official Implementation of the paper 'Learning Where to Learn in Cross-View Self-Supervised Learning'โ29Oct 12, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean โข AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- โ16Jan 6, 2025Updated last year
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"โ25Feb 2, 2025Updated last year
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videosโ37May 27, 2025Updated last year
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.โ29Oct 19, 2025Updated 7 months ago
- [ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes and Semanticsโ37Sep 10, 2025Updated 8 months ago
- Codes for "UniMF: A Unified Multimodal Framework for Multimodal Sentiment Analysis in Missing Modalities and Unaligned Multimodal Sequencโฆโ30Jan 9, 2024Updated 2 years ago
- โ11Oct 4, 2023Updated 2 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"โ11Dec 30, 2024Updated last year
- Official code repository for the paper A Large-scale AI-generated Image Inpainting Benchmarkโ16Jan 13, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- โ27Apr 25, 2022Updated 4 years ago
- [NeurIPS 2025] Deep Memory Backtracking for Long Video Understandingโ68Feb 10, 2026Updated 3 months ago
- โ55Apr 7, 2026Updated 2 months ago
- Code for "CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects", NeurIPS 2025โ90Mar 25, 2026Updated 2 months ago
- [CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Mapsโ13Mar 26, 2025Updated last year
- code for downloading videos from HowTo100M datasetโ18May 13, 2021Updated 5 years ago
- โ20Jun 10, 2025Updated 11 months ago
- โ22Mar 17, 2026Updated 2 months ago
- Gifts for landscape photographers. Help the photographer seeking for meteors in the photo sequence.โ13Jun 21, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean โข AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- โ13Apr 23, 2025Updated last year
- โ40Feb 23, 2025Updated last year
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmarkโ25Apr 13, 2026Updated last month
- โ15Feb 24, 2022Updated 4 years ago
- Official repository for โReasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Spaceโโ18Jan 27, 2026Updated 4 months ago
- โ36May 29, 2025Updated last year
- Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detectionโ31Mar 19, 2025Updated last year