[ICCV 2025] LVBench: An Extreme Long Video Understanding Benchmark
β144Jul 9, 2025Updated 11 months ago
Alternatives and similar repositories for LVBench
Users that are interested in LVBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.β131Jul 27, 2024Updated last year
- π₯π₯MLVU: Multi-task Long Video Understanding Benchmarkβ262Apr 13, 2026Updated 2 months ago
- β32Jul 29, 2024Updated last year
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMsβ57Mar 9, 2025Updated last year
- β11Aug 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β117Dec 30, 2024Updated last year
- Long Context Transfer from Language to Visionβ408Mar 18, 2025Updated last year
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Showsβ20Nov 4, 2025Updated 7 months ago
- β¨β¨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysisβ780Dec 8, 2025Updated 6 months ago
- [CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understandingβ703Jan 29, 2025Updated last year
- [ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, β¦β132Apr 4, 2025Updated last year
- [ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modelingβ526Nov 18, 2025Updated 7 months ago
- Official code of *Towards Event-oriented Long Video Understanding*β12Jul 26, 2024Updated last year
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]β20Feb 27, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- π₯π₯First-ever hour scale video understanding modelsβ626Jul 14, 2025Updated 11 months ago
- A Massive Multi-Discipline Lecture Understanding Benchmarkβ34Apr 20, 2026Updated 2 months ago
- A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.β73Oct 14, 2024Updated last year
- MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)β327Jan 20, 2025Updated last year
- [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmarkβ145Jun 4, 2025Updated last year
- β158Oct 31, 2024Updated last year
- [ICML 2025] Official PyTorch implementation of LongVUβ427May 8, 2025Updated last year
- β13Oct 19, 2023Updated 2 years ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Modelβ17Feb 13, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- πΎ E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)β74Jan 20, 2025Updated last year
- official impelmentation of Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Inputβ67Aug 30, 2024Updated last year
- Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understandingβ40Mar 16, 2025Updated last year
- β17Feb 22, 2024Updated 2 years ago
- [ACL 2024 π₯] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capβ¦β1,503Aug 5, 2025Updated 10 months ago
- A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!β140Dec 31, 2023Updated 2 years ago
- [CVPR 2025] Adaptive Keyframe Sampling for Long Video Understandingβ224Dec 19, 2025Updated 6 months ago
- β108Jul 30, 2024Updated last year
- β224Jul 5, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ECCV 2024π₯] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"β154Sep 10, 2024Updated last year
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''β118Aug 21, 2025Updated 10 months ago
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videosβ24May 7, 2026Updated last month
- VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMsβ1,303Jan 23, 2025Updated last year
- β41Nov 8, 2024Updated last year
- VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)β43Dec 16, 2025Updated 6 months ago
- [ICLR 2025] Mathematical Visual Instruction Tuning for Multi-modal Large Language Modelsβ155Dec 5, 2024Updated last year