Vchitect / VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
☆695Updated this week
Alternatives and similar repositories for VBench:
Users that are interested in VBench are comparing it to the libraries listed below
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆557Updated 2 months ago
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆395Updated 4 months ago
- ☆354Updated 2 months ago
- A reading list of video generation☆473Updated this week
- Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆871Updated this week
- Stable Video Diffusion Training Code and Extensions.☆654Updated 5 months ago
- A collection of awesome video generation studies.☆425Updated this week
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆391Updated 6 months ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆810Updated 2 weeks ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆534Updated 5 months ago
- [CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"☆535Updated last week
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆550Updated 3 months ago
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆432Updated 7 months ago
- Let's finetune video generation models!☆357Updated this week
- NOVA: Autoregressive Video Generation without Vector Quantization☆314Updated this week
- 🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).☆406Updated 3 weeks ago
- Multimodal Models in Real World☆427Updated 2 months ago
- (NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis☆621Updated 3 months ago
- This repo contains the code for 1D tokenizer and generator☆645Updated this week
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆488Updated 4 months ago
- You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.☆274Updated last week
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆393Updated 2 months ago
- Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Con…☆452Updated 2 months ago
- Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model☆401Updated 7 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆280Updated 2 months ago
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation☆212Updated 2 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆150Updated 3 months ago
- ☆457Updated 4 months ago
- Implementation of MagViT2 Tokenizer in Pytorch☆588Updated this week
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆476Updated 7 months ago