๐ A curated collection of papers and open-source code repositories dedicated to the application of Vision-Language Models (VLMs) for streaming video.
โ93Apr 5, 2026Updated this week
Alternatives and similar repositories for Awesome-VLM-Streaming-Video
Users that are interested in Awesome-VLM-Streaming-Video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] FOCUS: Efficient Keyframe Selection for Long Video Understandingโ59Feb 3, 2026Updated 2 months ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)โ21Aug 1, 2025Updated 8 months ago
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"โ43Oct 9, 2025Updated 5 months ago
- โ10Feb 15, 2025Updated last year
- [๐๐๐ญ๐ฎ๐ซ๐ ๐๐จ๐ฆ๐ฉ๐ฎ๐ญ๐๐ญ๐ข๐จ๐ง๐๐ฅ ๐๐๐ข๐๐ง๐๐] โก๏ธ PSE/PSRN: Fast and efficient symbolic expression discovery through parallelizโฆโ21Feb 3, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient โข AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [๐๐๐ญ๐ฎ๐ซ๐ ๐๐จ๐ฆ๐ฆ๐ฎ๐ง๐ข๐๐๐ญ๐ข๐จ๐ง๐ฌ] ๐ค๐ก LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Cโฆโ23Mar 8, 2026Updated last month
- ๐ SwarmBench: Benchmarking LLMs' Swarm Intelligenceโ30May 21, 2025Updated 10 months ago
- PromptRose ๐น is your AI prompt companion, blooming at your fingertips.โ22Sep 1, 2025Updated 7 months ago
- Visual Speech Recongnitionโ20Dec 24, 2024Updated last year
- โ12Apr 29, 2024Updated last year
- โ20Feb 28, 2026Updated last month
- โ12Sep 2, 2023Updated 2 years ago
- PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Modelsโ35Jan 14, 2026Updated 2 months ago
- [CVPR2026] VideoITG: Multimodal Video Understanding with Instructed Temporal Groundingโ110Mar 26, 2026Updated last week
- Proton VPN Special Offer - Get 70% off โข AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- LineArt, a framework that transfers complex appearance onto detailed design drawings, facilitating design and artistic creation.โ14Oct 2, 2025Updated 6 months ago
- HUST็ตๅทฅๅบๅฐ-ๅบไบCubeMX็STM32ๅผๅ ่ง้ข็ไธป่ฆไปๅบๆฏ่ฟไธชใgiteeไผๅๆญฅไธไผ ใโ22May 9, 2024Updated last year
- [CVPR2022] Official Implementation of the paper 'Learning Where to Learn in Cross-View Self-Supervised Learning'โ29Oct 12, 2022Updated 3 years ago
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videosโ34May 27, 2025Updated 10 months ago
- ๅฝ็งๅคง้ๆ ๆนๆ กๅบ2024~2025ๅนด่ฏพ็จ่ตๆ๏ผๅ ๆฌๅผบๅๅญฆไน ใๆบ่ฝ่ฎก็ฎ็ณป็ปใๆจกๅผ่ฏๅซใ็ฉ้ตๅๆไธๅบ็จใไบบๅทฅๆบ่ฝๅ็ไธ็ฎๆณใ่ช็ถ่ฏญ่จๅค็โ41Sep 22, 2025Updated 6 months ago
- [CVPR2025] "AniMo: Species-Aware Model for Text-Driven Animal Motion Generation"โ45Oct 8, 2025Updated 6 months ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"โ25Feb 2, 2025Updated last year
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.โ25Oct 19, 2025Updated 5 months ago
- [ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes and Semanticsโ38Sep 10, 2025Updated 6 months ago
- NordVPN Special Discount Offer โข AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- โ11Oct 4, 2023Updated 2 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"โ10Dec 30, 2024Updated last year
- Code for "CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects", NeurIPS 2025โ88Mar 25, 2026Updated 2 weeks ago
- [NeurIPS 2025] Deep Memory Backtracking for Long Video Understandingโ67Feb 10, 2026Updated last month
- Official code repository for the paper A Large-scale AI-generated Image Inpainting Benchmarkโ15Jan 13, 2026Updated 2 months ago
- TermHub is a terminal-style homepage template.โ80Mar 17, 2026Updated 3 weeks ago
- [ICLR 2026] MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learningโ25Jan 14, 2026Updated 2 months ago
- โ19Jun 10, 2025Updated 9 months ago
- Gifts for landscape photographers. Help the photographer seeking for meteors in the photo sequence.โ13Jun 21, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off โข AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streamsโ76Mar 15, 2026Updated 3 weeks ago
- โ13Apr 23, 2025Updated 11 months ago
- โ22Feb 13, 2026Updated last month
- Official repository for โReasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Spaceโโ18Jan 27, 2026Updated 2 months ago
- โ67Sep 3, 2025Updated 7 months ago
- Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detectionโ31Mar 19, 2025Updated last year
- โ35May 29, 2025Updated 10 months ago