π₯π₯MLVU: Multi-task Long Video Understanding Benchmark
β255Apr 13, 2026Updated last month
Alternatives and similar repositories for MLVU
Users that are interested in MLVU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMsβ56Mar 9, 2025Updated last year
- π₯π₯First-ever hour scale video understanding modelsβ622Jul 14, 2025Updated 10 months ago
- Long Context Transfer from Language to Visionβ403Mar 18, 2025Updated last year
- [Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.β125Jul 27, 2024Updated last year
- β¨β¨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysisβ768Dec 8, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICCV 2025] LVBench: An Extreme Long Video Understanding Benchmarkβ144Jul 9, 2025Updated 10 months ago
- β32Jul 29, 2024Updated last year
- Official implementation of paper AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understandingβ89Apr 21, 2026Updated last month
- β111Dec 30, 2024Updated last year
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Showsβ20Nov 4, 2025Updated 6 months ago
- [ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modelingβ524Nov 18, 2025Updated 6 months ago
- [CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understandingβ698Jan 29, 2025Updated last year
- Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understandingβ40Mar 16, 2025Updated last year
- β37Sep 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, β¦β131Apr 4, 2025Updated last year
- Awesome papers & datasets specifically focused on long-term videos.β374Oct 9, 2025Updated 7 months ago
- This is the official implementation of ICCV 2025 "Flash-VStream: Efficient Real-Time Understanding for Long Video Streams"β279Oct 15, 2025Updated 7 months ago
- VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMsβ1,297Jan 23, 2025Updated last year
- β81Nov 24, 2024Updated last year
- Official repository for the paper PLLaVAβ672Jul 28, 2024Updated last year
- β157Oct 31, 2024Updated last year
- A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.β74Oct 14, 2024Updated last year
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clouβ¦β3,797Mar 12, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official code of *Towards Event-oriented Long Video Understanding*β12Jul 26, 2024Updated last year
- π₯π₯[NeurIPS2025]Exploring and mitigating semantic hallucinations in scene text perception and reasoningβ29Dec 11, 2025Updated 5 months ago
- β148Nov 17, 2025Updated 6 months ago
- β¨First Open-Source R1-like Video-LLM [2025/02/18]β383Feb 23, 2025Updated last year
- [CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.β3,339Jan 18, 2025Updated last year
- Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understandingβ291Aug 5, 2025Updated 9 months ago
- One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasksβ4,150Updated this week
- A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Abilityβ106Nov 28, 2024Updated last year
- β4,667Apr 15, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β160Jan 16, 2025Updated last year
- β56Mar 19, 2025Updated last year
- β37Nov 8, 2024Updated last year
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmarkβ165Mar 29, 2026Updated last month
- π₯π₯π₯ [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.β3,176Mar 28, 2026Updated last month
- [ICML 2025] Official PyTorch implementation of LongVUβ425May 8, 2025Updated last year
- β248Jun 4, 2025Updated 11 months ago