Vchitect/ShotBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Vchitect/ShotBench)

Vchitect / ShotBench

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

☆102

Alternatives and similar repositories for ShotBench

Users that are interested in ShotBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PRIS-CV / CineTechBench
View on GitHub
A Benchmark for Cinematographic Technique Understanding and Generation
☆29Sep 19, 2025Updated 10 months ago
Vchitect / Cut2Next
View on GitHub
Cut2Next: Generating Next Shot via In-Context Tuning
☆33Aug 21, 2025Updated 11 months ago
Vchitect / Uni-MMMU
View on GitHub
[ACL2026 oral] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark
☆25Apr 13, 2026Updated 3 months ago
3DTopia / GenDoP
View on GitHub
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
☆126Dec 31, 2025Updated 6 months ago
penghao-wu / ProxyV
View on GitHub
[ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM
☆20May 22, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
showlab / MovieAgent
View on GitHub
MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning
☆349Mar 26, 2025Updated last year
Vchitect / RealDPO
View on GitHub
☆32Dec 17, 2025Updated 7 months ago
TencentARC / Video-Holmes
View on GitHub
[ECCV 2026] Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?
☆95Jul 13, 2025Updated last year
gen-ai-team / kandinsky-video-tools
View on GitHub
A set of NN models for evaluating object and camera motion in videos
☆16Jul 24, 2025Updated last year
mutonix / Vript
View on GitHub
☆161Jan 16, 2025Updated last year
Adam-duan / DiffRetouch
View on GitHub
[AAAI2025] This is the official PyTorch codes for the paper: "DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts"
☆25Jun 16, 2025Updated last year
SPIresearch / EviPrompt
View on GitHub
☆15Nov 13, 2024Updated last year
SaraGhazanfari / CoF
View on GitHub
Chain-of-Frames [CVPR 2026]
☆40Jul 2, 2025Updated last year
Jyxarthur / shot-by-shot
View on GitHub
[ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…
☆24May 16, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Vchitect / Evaluation-Agent
View on GitHub
[ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible
☆128Aug 10, 2025Updated 11 months ago
byhuang123 / PoCo
View on GitHub
[CVPR2026] Official implementation of our paper “Rethinking Position Embedding as a Context Controller for Multi-Reference and Multi-Shot…
☆19Apr 8, 2026Updated 3 months ago
ljzycmd / SCD
View on GitHub
Consistent Human Image and Video Generation with Spatially Conditioned Diffusion
☆16Sep 1, 2025Updated 10 months ago
lambert-x / VideoAuteur
View on GitHub
VideoAuteur: Towards Long Narrative Video Generation
☆44Oct 22, 2025Updated 9 months ago
agwmon / frame-guidance
View on GitHub
[ICLR 2026] Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models
☆64Mar 3, 2026Updated 4 months ago
wuxiaofei01 / PFVG
View on GitHub
☆20Dec 24, 2025Updated 7 months ago
Vchitect / LiteGen
View on GitHub
A light-weight and high-efficient training framework for accelerating diffusion tasks.
☆53Apr 23, 2026Updated 3 months ago
KlingAIResearch / CamCloneMaster
View on GitHub
[SIGGRAPH Asia'25] Enabling Reference-based Camera Control via Context without Explicit 3D Estimation
☆159Jan 18, 2026Updated 6 months ago
PardoAlejo / MovieCuts
View on GitHub
Learning to cut end-to-end pretrained modules
☆38Apr 17, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mfarre / Video-LLaVA-7B-hf-CinePile
View on GitHub
Video-LlaVA fine-tune for CinePile evaluation
☆51Aug 8, 2024Updated last year
mira-space / MiraData
View on GitHub
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
☆528Sep 2, 2024Updated last year
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated 2 years ago
CV-xueba / PICD_ImageComposition
View on GitHub
☆56Sep 4, 2025Updated 10 months ago
NVlabs / Long-RL
View on GitHub
Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)
☆727Sep 24, 2025Updated 10 months ago
yosefdayani / MV-RAG
View on GitHub
MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…
☆23Nov 29, 2025Updated 7 months ago
DYEvaLab / EvalMuse-Structure
View on GitHub
☆18Feb 12, 2025Updated last year
Sense-GVT / BigPretrain
View on GitHub
A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)
☆15Oct 18, 2021Updated 4 years ago
baaivision / Emu3.5
View on GitHub
Native Multimodal Models are World Learners
☆1,538Dec 30, 2025Updated 6 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
VidCapBench / VidCapBench
View on GitHub
☆13May 17, 2025Updated last year
VirtualFilmStudio / Cinetransfer
View on GitHub
☆35Jun 18, 2024Updated 2 years ago
CUC-MIPG / UniVid
View on GitHub
Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026
☆37Nov 24, 2025Updated 8 months ago
Vchitect / LongVie
View on GitHub
☆334Jan 24, 2026Updated 6 months ago
yukangcao / AvatarGO
View on GitHub
[ICLR' 25] AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
☆69Mar 19, 2025Updated last year
JiuhaiChen / BLIP3o
View on GitHub
Official implementation of BLIP3o-Series
☆1,664Nov 29, 2025Updated 7 months ago
Vchitect / VBench
View on GitHub
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
☆1,706Mar 23, 2026Updated 4 months ago