[ICCV 2025] LVBench: An Extreme Long Video Understanding Benchmark
β144Jul 9, 2025Updated 11 months ago
Alternatives and similar repositories for LVBench
Users that are interested in LVBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.β130Jul 27, 2024Updated last year
- π₯π₯MLVU: Multi-task Long Video Understanding Benchmarkβ261Apr 13, 2026Updated last month
- β32Jul 29, 2024Updated last year
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMsβ56Mar 9, 2025Updated last year
- β11Aug 4, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- β116Dec 30, 2024Updated last year
- Long Context Transfer from Language to Visionβ405Mar 18, 2025Updated last year
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Showsβ20Nov 4, 2025Updated 7 months ago
- β¨β¨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysisβ779Dec 8, 2025Updated 6 months ago
- [CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understandingβ699Jan 29, 2025Updated last year
- [ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, β¦β131Apr 4, 2025Updated last year
- [ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modelingβ527Nov 18, 2025Updated 6 months ago
- Official code of *Towards Event-oriented Long Video Understanding*β12Jul 26, 2024Updated last year
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]β21Feb 27, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- π₯π₯First-ever hour scale video understanding modelsβ624Jul 14, 2025Updated 10 months ago
- A Massive Multi-Discipline Lecture Understanding Benchmarkβ34Apr 20, 2026Updated last month
- A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.β74Oct 14, 2024Updated last year
- MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)β327Jan 20, 2025Updated last year
- [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmarkβ144Jun 4, 2025Updated last year
- β157Oct 31, 2024Updated last year
- [ICML 2025] Official PyTorch implementation of LongVUβ427May 8, 2025Updated last year
- β13Oct 19, 2023Updated 2 years ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Modelβ17Feb 13, 2025Updated last year
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- πΎ E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)β74Jan 20, 2025Updated last year
- official impelmentation of Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Inputβ68Aug 30, 2024Updated last year
- Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understandingβ40Mar 16, 2025Updated last year
- β17Feb 22, 2024Updated 2 years ago
- [ACL 2024 π₯] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capβ¦β1,502Aug 5, 2025Updated 10 months ago
- A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!β139Dec 31, 2023Updated 2 years ago
- [CVPR 2025] Adaptive Keyframe Sampling for Long Video Understandingβ218Dec 19, 2025Updated 5 months ago
- β108Jul 30, 2024Updated last year
- β222Jul 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ECCV 2024π₯] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"β155Sep 10, 2024Updated last year
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''β117Aug 21, 2025Updated 9 months ago
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videosβ23May 7, 2026Updated last month
- VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMsβ1,299Jan 23, 2025Updated last year
- β39Nov 8, 2024Updated last year
- VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)β43Dec 16, 2025Updated 5 months ago
- [ICLR 2025] Mathematical Visual Instruction Tuning for Multi-modal Large Language Modelsβ155Dec 5, 2024Updated last year