Vchitect / VBenchLinks

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

☆1,135

Alternatives and similar repositories for VBench

Users that are interested in VBench are comparing it to the libraries listed below

Sorting:

yzhang2016 / video-generation-survey
A reading list of video generation
☆607Updated 2 weeks ago
FoundationVision / Infinity
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
☆1,393Updated last month
mira-space / MiraData
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
☆461Updated 11 months ago
dvlab-research / ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
☆1,600Updated 10 months ago
AlonzoLeeeooo / awesome-video-generation
A collection of awesome video generation studies.
☆586Updated last week
snap-research / Panda-70M
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
☆613Updated 9 months ago
pixeli99 / SVD_Xtend
Stable Video Diffusion Training Code and Extensions.
☆707Updated last year
huggingface / finetrainers
Scalable and memory-optimized training of diffusion models
☆1,232Updated 2 months ago
VideoVerses / VideoTuna
Let's finetune video generation models!
☆490Updated 2 months ago
mira-space / Mira
☆360Updated 9 months ago
XueZeyue / DanceGRPO
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
☆537Updated this week
baaivision / NOVA
[ICLR 2025] Autoregressive Video Generation without Vector Quantization
☆553Updated 3 weeks ago
yifan123 / flow_grpo
An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
☆1,002Updated this week
jjihwan / FIFO-Diffusion_public
Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)
☆468Updated 9 months ago
Vchitect / Latte
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
☆1,855Updated 3 months ago
AILab-CVC / SEED-X
Multimodal Models in Real World
☆530Updated 5 months ago
Alpha-VLLM / Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…
☆610Updated 4 months ago
TencentARC / SEED-Voken
SEED-Voken: A Series of Powerful Visual Tokenizers
☆922Updated last month
PKU-YuanGroup / UniWorld-V1
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
☆671Updated this week
tianweiy / CausVid
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
☆836Updated 2 months ago
Vchitect / LaVie
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
☆937Updated 8 months ago
NUS-HPC-AI-Lab / VideoSys
VideoSys: An easy and efficient system for video generation
☆1,992Updated 4 months ago
djghosh13 / geneval
GenEval: An object-focused framework for evaluating text-to-image alignment
☆333Updated 5 months ago
tgxs002 / HPSv2
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
☆546Updated last year
tianweiy / DMD2
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
☆869Updated 5 months ago
JunyaoHu / common_metrics_on_video_quality
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
☆425Updated 7 months ago
VideoVerses / VideoVAEPlus
[ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE
☆343Updated 6 months ago
ChaofanTao / Autoregressive-Models-in-Vision-Survey
[TMLR 2025🔥] A survey for the autoregressive models in vision.
☆665Updated last week
MyNiuuu / MOFA-Video
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
☆751Updated 8 months ago
FoundationVision / LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,821Updated 11 months ago