Vchitect / VBenchLinks
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
☆1,312Updated 3 weeks ago
Alternatives and similar repositories for VBench
Users that are interested in VBench are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,493Updated this week
- A reading list of video generation☆632Updated this week
- A collection of awesome video generation studies.☆670Updated last month
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,620Updated last year
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆492Updated last year
- Scalable and memory-optimized training of diffusion models☆1,300Updated 5 months ago
- Let's finetune video generation models!☆518Updated 2 months ago
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆642Updated last year
- Stable Video Diffusion Training Code and Extensions.☆721Updated last year
- [NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL☆1,570Updated last week
- VideoSys: An easy and efficient system for video generation☆2,005Updated 2 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆593Updated 2 weeks ago
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,883Updated 2 weeks ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,885Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆973Updated 3 weeks ago
- [TMLR 2025🔥] A survey for the autoregressive models in vision.☆747Updated last week
- You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.☆494Updated 10 months ago
- ☆359Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆612Updated last year
- Multimodal Models in Real World☆548Updated 8 months ago
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆783Updated 3 weeks ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆629Updated 3 weeks ago
- Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)☆476Updated last year
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,031Updated 3 months ago
- [ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey…☆538Updated 2 weeks ago
- (NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis☆1,051Updated 8 months ago
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆1,735Updated 2 months ago
- 📹 A more flexible framework that can generate videos at any resolution and creates videos from images.☆1,535Updated this week
- Diffusion Model-Based Image Editing: A Survey (TPAMI 2025)☆684Updated 4 months ago
- GenEval: An object-focused framework for evaluating text-to-image alignment☆387Updated 8 months ago