THUDM / LVBenchLinks

[ICCV 2025] LVBench: An Extreme Long Video Understanding Benchmark

☆95

Alternatives and similar repositories for LVBench

Users that are interested in LVBench are comparing it to the libraries listed below

Sorting:

Liuziyu77 / MMDU
Official repository of MMDU dataset
☆92Updated 9 months ago
RifleZhang / LLaVA-Hound-DPO
☆152Updated 8 months ago
llyx97 / TempCompass
[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, …
☆118Updated 3 months ago
longvideobench / LongVideoBench
[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.
☆102Updated 11 months ago
OpenGVLab / MM-NIAH
[NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of…
☆117Updated 7 months ago
TIGER-AI-Lab / Mantis
Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024]
☆218Updated 3 months ago
OpenGVLab / MMT-Bench
ICML'2024 | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
☆113Updated 11 months ago
joez17 / VideoNIAH
VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs
☆47Updated 4 months ago
DAMO-NLP-SG / multimodal_textbook
[ICCV 2025] The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"
☆164Updated 3 months ago
JUNJIE99 / MLVU
🔥🔥MLVU: Multi-task Long Video Understanding Benchmark
☆210Updated last month
KangarooGroup / Kangaroo
official impelmentation of Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
☆66Updated 10 months ago
X2FD / LVIS-INSTRUCT4V
☆133Updated last year
HJYao00 / DenseConnector
【NeurIPS 2024】Dense Connector for MLLMs
☆167Updated 8 months ago
imagegridworth / IG-VLM
☆136Updated 9 months ago
MMStar-Benchmark / MMStar
[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"
☆185Updated 9 months ago
bronyayang / Law_of_Vision_Representation_in_MLLMs
Official implementation of the Law of Vision Representation in MLLMs
☆160Updated 7 months ago
FreedomIntelligence / ALLaVA
Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model
☆266Updated last year
LengSicong / MMR1
MMR1: Advancing the Frontiers of Multimodal Reasoning
☆162Updated 3 months ago
PKU-YuanGroup / Video-Bench
A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!
☆128Updated last year
luogen1996 / LLaVA-HR
[ICLR2025] LLaVA-HR: High-Resolution Large Language-Vision Assistant
☆237Updated 10 months ago
Yangyi-Chen / SOLO
[TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"
☆143Updated 7 months ago
Kwai-YuanQi / MM-RLHF
The Next Step Forward in Multimodal LLM Alignment
☆169Updated 2 months ago
BAAI-DCAI / DataOptim
A collection of visual instruction tuning datasets.
☆76Updated last year
open-compass / MMBench
Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"
☆229Updated last month
swordlidev / Evaluation-Multimodal-LLMs-Survey
A Survey on Benchmarks of Multimodal Large Language Models
☆119Updated last week
EvolvingLMMs-Lab / LongVA
Long Context Transfer from Language to Vision
☆384Updated 3 months ago
TIGER-AI-Lab / VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"
☆124Updated last month
vlf-silkie / VLFeedback
☆100Updated last year
baaivision / EVE
EVE Series: Encoder-Free Vision-Language Models from BAAI
☆333Updated this week
42Shawn / LLaVA-PruMerge
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
☆139Updated 2 weeks ago