[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
☆286Dec 4, 2024Updated last year
Alternatives and similar repositories for CV-VAE
Users that are interested in CV-VAE are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆399May 30, 2025Updated 9 months ago
- This repo contains the code for 1D tokenizer and generator☆1,117Mar 20, 2025Updated 11 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆629Oct 29, 2025Updated 4 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,252Feb 16, 2025Updated last year
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆674Oct 25, 2024Updated last year
- [ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆392Jan 19, 2025Updated last year
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆566Sep 16, 2024Updated last year
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,920Oct 30, 2025Updated 4 months ago
- [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.☆321Jul 9, 2024Updated last year
- ☆643May 24, 2024Updated last year
- ☆66Jun 4, 2024Updated last year
- Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)☆481Oct 18, 2024Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,936Aug 15, 2024Updated last year
- [CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models☆295May 17, 2025Updated 9 months ago
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models☆545Jan 18, 2024Updated 2 years ago
- [ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.☆510Mar 7, 2024Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆996Nov 25, 2025Updated 3 months ago
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆534Sep 8, 2025Updated 5 months ago
- VideoSys: An easy and efficient system for video generation☆2,016Aug 27, 2025Updated 6 months ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆427Aug 25, 2025Updated 6 months ago
- NeurIPS 2024☆395Sep 26, 2024Updated last year
- Code repository for T2V-Turbo and T2V-Turbo-v2☆314Jan 31, 2025Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆198Jan 7, 2026Updated last month
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆111Sep 19, 2025Updated 5 months ago
- Stable Video Diffusion Training Code and Extensions.☆732Jul 25, 2024Updated last year
- Pixel-Space Generative Models☆301May 11, 2025Updated 9 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆86Feb 27, 2025Updated last year
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,249Mar 6, 2025Updated 11 months ago
- ☆361Oct 21, 2024Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 4 months ago
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation [TMLR 2024]☆260Jul 1, 2024Updated last year
- A suite of image and video neural tokenizers☆1,711Feb 11, 2025Updated last year
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆309Mar 12, 2025Updated 11 months ago
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆767Dec 5, 2024Updated last year
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆138Oct 8, 2024Updated last year
- Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators"☆359Dec 8, 2023Updated 2 years ago
- [CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆198May 11, 2025Updated 9 months ago
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆512Dec 11, 2024Updated last year