PKU-YuanGroup/Video-Bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PKU-YuanGroup/Video-Bench)

PKU-YuanGroup / Video-Bench

A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!

☆140

Alternatives and similar repositories for Video-Bench

Users that are interested in Video-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PKU-YuanGroup / GPT-as-Language-Tree
View on GitHub
GPT as a Monte Carlo Language Tree: A Probabilistic Perspective
☆46Jan 18, 2025Updated last year
HowardLi1984 / ECDFormer
View on GitHub
【Nature Computational Science 2025🔥】Deep peak property learning for efficient chiral molecules ECD spectra prediction
☆51Jan 12, 2025Updated last year
PKU-YuanGroup / LanguageBind
View on GitHub
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
☆884Mar 25, 2024Updated 2 years ago
munanning / MADAv2
View on GitHub
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation
☆25Jul 8, 2023Updated 3 years ago
Tencent-Hunyuan / GEAR
View on GitHub
☆65Jul 1, 2026Updated 3 weeks ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
IDEA-XL / ChemCoTBench
View on GitHub
LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry
☆55Oct 9, 2025Updated 9 months ago
PKU-YuanGroup / N-LoRA
View on GitHub
【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".
☆38Dec 5, 2024Updated last year
PKU-YuanGroup / AsFT
View on GitHub
Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".
☆37Jul 10, 2025Updated last year
llyx97 / TempCompass
View on GitHub
[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, …
☆133Apr 4, 2025Updated last year
KlingAIResearch / Uniaa
View on GitHub
Unified Multi-modal IAA Baseline and Benchmark
☆94Sep 27, 2024Updated last year
lscpku / VITATECS
View on GitHub
☆18Jul 10, 2024Updated 2 years ago
cxh0519 / Progressive3D
View on GitHub
Official implementation of "Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts" [IC…
☆123Jun 27, 2024Updated 2 years ago
PKU-YuanGroup / EvaGaussians
View on GitHub
☆60Mar 16, 2025Updated last year
PKU-YuanGroup / Next-Patch-Prediction
View on GitHub
[AAAI26] Next Patch Prediction
☆129Jan 2, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PKU-YuanGroup / Video-LLaVA
View on GitHub
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
☆3,494Dec 3, 2024Updated last year
PKU-YuanGroup / WF-VAE
View on GitHub
[CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
☆205May 11, 2025Updated last year
Share14 / ShareGemini
View on GitHub
☆32Jul 29, 2024Updated last year
PKU-YuanGroup / Chat-UniVi
View on GitHub
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
☆943Oct 16, 2024Updated last year
PKU-YuanGroup / TaxDiff
View on GitHub
The official code for "TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation"
☆75Aug 23, 2024Updated last year
AILab-CVC / SEED-Bench
View on GitHub
(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
☆366Jan 14, 2025Updated last year
DCDmllm / Momentor
View on GitHub
☆81Nov 24, 2024Updated last year
google-deepmind / perception_test
View on GitHub
☆254Jun 19, 2026Updated last month
qiujihao19 / Artemis
View on GitHub
[NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos
☆27Apr 8, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RupertLuo / Valley
View on GitHub
The official repository of "Video assistant towards large language model makes everything easy"
☆232Dec 24, 2024Updated last year
RUCAIBox / Event-Bench
View on GitHub
Official code of *Towards Event-oriented Long Video Understanding*
☆12Jul 26, 2024Updated 2 years ago
RenShuhuai-Andy / TimeChat
View on GitHub
[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
☆425May 8, 2025Updated last year
PKU-YuanGroup / HoloTime
View on GitHub
[ACM MM 2025] HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation
☆159Sep 4, 2025Updated 10 months ago
mbzuai-oryx / Video-ChatGPT
View on GitHub
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the cap…
☆1,505Aug 5, 2025Updated 11 months ago
PKU-YuanGroup / Envision3D
View on GitHub
Envision3D: One Image to 3D with Anchor Views Interpolation
☆116May 16, 2024Updated 2 years ago
PKU-YuanGroup / Hallucination-Attack
View on GitHub
Attack to induce LLMs within hallucinations
☆163May 17, 2024Updated 2 years ago
ilkerkesen / ViLMA
View on GitHub
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)
☆16Jan 18, 2024Updated 2 years ago
Lyu6PosHao / HME
View on GitHub
Here is the official code for Nature Communications "Navigating Chemical-Linguistic Sharing Space with Heterogeneous Molecular Encoding".
☆23May 23, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
MME-Benchmarks / Video-MME
View on GitHub
✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
☆788Dec 8, 2025Updated 7 months ago
PKU-YuanGroup / WISE
View on GitHub
[ICML 2026🔥] WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
☆212Jun 26, 2026Updated last month
jiawangbai / HAT
View on GitHub
Implementation of HAT https://arxiv.org/pdf/2204.00993
☆51Mar 23, 2024Updated 2 years ago
mbzuai-oryx / Video-LLaVA
View on GitHub
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
☆264Aug 5, 2025Updated 11 months ago
hshjerry / VideoEspresso
View on GitHub
[CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection
☆140Jul 28, 2025Updated 11 months ago
PKU-YuanGroup / Machine-Mindset
View on GitHub
An MBTI Exploration of Large Language Models
☆538Feb 2, 2024Updated 2 years ago
PKU-YuanGroup / repaint123
View on GitHub
Official implementation of Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting (ECCV…
☆276Apr 23, 2026Updated 3 months ago