Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen
☆435Mar 8, 2025Updated 11 months ago
Alternatives and similar repositories for MovieGenBench
Users that are interested in MovieGenBench are comparing it to the libraries listed below
Sorting:
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,496Feb 23, 2026Updated last week
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆674Oct 25, 2024Updated last year
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆506Sep 2, 2024Updated last year
- Code repository for T2V-Turbo and T2V-Turbo-v2☆314Jan 31, 2025Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆996Nov 25, 2025Updated 3 months ago
- A suite of image and video neural tokenizers☆1,711Feb 11, 2025Updated last year
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆399May 30, 2025Updated 9 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,162Dec 21, 2024Updated last year
- The best OSS video generation models, created by Genmo☆3,611Nov 14, 2025Updated 3 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆428Sep 24, 2025Updated 5 months ago
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆190Jan 27, 2025Updated last year
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,132Feb 7, 2025Updated last year
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,560Mar 16, 2025Updated 11 months ago
- Next-Token Prediction is All You Need☆2,355Jan 12, 2026Updated last month
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,936Aug 15, 2024Updated last year
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,887Jan 8, 2026Updated last month
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆566Sep 16, 2024Updated last year
- VideoSys: An easy and efficient system for video generation☆2,016Aug 27, 2025Updated 6 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆142Sep 28, 2024Updated last year
- Scalable and memory-optimized training of diffusion models☆1,341Jun 4, 2025Updated 9 months ago
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,920Oct 30, 2025Updated 4 months ago
- Keyframe Interpolation with CogvideoX☆139Oct 31, 2024Updated last year
- VideoGen-Eval: Agent-based System for Video Generation Evaluation☆257Dec 16, 2025Updated 2 months ago
- Stable Video Diffusion Training Code and Extensions.☆732Jul 25, 2024Updated last year
- Scaling Diffusion Transformers with Mixture of Experts☆417Sep 9, 2024Updated last year
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆638Oct 16, 2025Updated 4 months ago
- A curated list of recent diffusion models for video generation, editing, and various other applications.☆5,489Updated this week
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆629Oct 29, 2025Updated 4 months ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,796May 20, 2025Updated 9 months ago
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆11,780Nov 21, 2025Updated 3 months ago
- EDM2 and Autoguidance -- Official PyTorch implementation☆822Dec 9, 2024Updated last year
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,863Feb 20, 2026Updated last week
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆948Nov 13, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,252Feb 16, 2025Updated last year
- [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆1,402Dec 16, 2025Updated 2 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆286Dec 4, 2024Updated last year
- ☆3,174Mar 17, 2025Updated 11 months ago
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆130Oct 18, 2024Updated last year