[ICCV 2025] LVBench: An Extreme Long Video Understanding Benchmark
β144Jul 9, 2025Updated 10 months ago
Alternatives and similar repositories for LVBench
Users that are interested in LVBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.β125Jul 27, 2024Updated last year
- π₯π₯MLVU: Multi-task Long Video Understanding Benchmarkβ255Apr 13, 2026Updated last month
- β32Jul 29, 2024Updated last year
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMsβ57Mar 9, 2025Updated last year
- β11Aug 4, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β111Dec 30, 2024Updated last year
- Long Context Transfer from Language to Visionβ403Mar 18, 2025Updated last year
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Showsβ20Nov 4, 2025Updated 6 months ago
- β¨β¨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysisβ768Dec 8, 2025Updated 5 months ago
- [CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understandingβ698Jan 29, 2025Updated last year
- [ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, β¦β131Apr 4, 2025Updated last year
- [ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modelingβ523Nov 18, 2025Updated 6 months ago
- Official code of *Towards Event-oriented Long Video Understanding*β12Jul 26, 2024Updated last year
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]β21Feb 27, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- π₯π₯First-ever hour scale video understanding modelsβ622Jul 14, 2025Updated 10 months ago
- A Massive Multi-Discipline Lecture Understanding Benchmarkβ34Apr 20, 2026Updated last month
- A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.β74Oct 14, 2024Updated last year
- MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)β326Jan 20, 2025Updated last year
- [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmarkβ142Jun 4, 2025Updated 11 months ago
- β157Oct 31, 2024Updated last year
- [ICML 2025] Official PyTorch implementation of LongVUβ425May 8, 2025Updated last year
- β13Oct 19, 2023Updated 2 years ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Modelβ17Feb 13, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- πΎ E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)β74Jan 20, 2025Updated last year
- official impelmentation of Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Inputβ68Aug 30, 2024Updated last year
- Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understandingβ40Mar 16, 2025Updated last year
- [CVPR 2025] Adaptive Keyframe Sampling for Long Video Understandingβ213Dec 19, 2025Updated 5 months ago
- β17Feb 22, 2024Updated 2 years ago
- [ACL 2024 π₯] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capβ¦β1,500Aug 5, 2025Updated 9 months ago
- A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!β138Dec 31, 2023Updated 2 years ago
- β108Jul 30, 2024Updated last year
- β222Jul 5, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ECCV 2024π₯] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"β155Sep 10, 2024Updated last year
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''β116Aug 21, 2025Updated 9 months ago
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videosβ23May 7, 2026Updated 2 weeks ago
- VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMsβ1,297Jan 23, 2025Updated last year
- β37Nov 8, 2024Updated last year
- VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)β43Dec 16, 2025Updated 5 months ago
- [ICLR 2025] Mathematical Visual Instruction Tuning for Multi-modal Large Language Modelsβ155Dec 5, 2024Updated last year