[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
☆286Dec 4, 2024Updated last year
Alternatives and similar repositories for CV-VAE
Users that are interested in CV-VAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆408May 30, 2025Updated 9 months ago
- This repo contains the code for 1D tokenizer and generator☆1,129Mar 20, 2025Updated last year
- [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.☆323Jul 9, 2024Updated last year
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆636Oct 29, 2025Updated 4 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆676Oct 25, 2024Updated last year
- [ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆398Jan 19, 2025Updated last year
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,929Oct 30, 2025Updated 4 months ago
- ☆66Jun 4, 2024Updated last year
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆569Sep 16, 2024Updated last year
- [ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.☆509Mar 7, 2024Updated 2 years ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,941Aug 15, 2024Updated last year
- Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)☆482Oct 18, 2024Updated last year
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆428Aug 25, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- VideoSys: An easy and efficient system for video generation☆2,020Aug 27, 2025Updated 6 months ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆998Nov 25, 2025Updated 4 months ago
- ☆646May 24, 2024Updated last year
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models☆545Jan 18, 2024Updated 2 years ago
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆534Sep 8, 2025Updated 6 months ago
- NeurIPS 2024☆395Sep 26, 2024Updated last year
- [CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models☆296May 17, 2025Updated 10 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,284Oct 31, 2024Updated last year
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆202Jan 7, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,252Mar 6, 2025Updated last year
- A suite of image and video neural tokenizers☆1,716Feb 11, 2025Updated last year
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆111Sep 19, 2025Updated 6 months ago
- ☆362Oct 21, 2024Updated last year
- ☆214Feb 11, 2025Updated last year
- Stable Video Diffusion Training Code and Extensions.☆734Jul 25, 2024Updated last year
- ☆190Dec 17, 2024Updated last year
- ☆30Mar 4, 2025Updated last year
- Code repository for T2V-Turbo and T2V-Turbo-v2☆314Jan 31, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,541Mar 16, 2026Updated last week
- Official Code for MotionCtrl [SIGGRAPH 2024]☆1,492Feb 19, 2025Updated last year
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆312Mar 12, 2025Updated last year
- Official JAX implementation of MAGVIT: Masked Generative Video Transformer☆995Jan 17, 2024Updated 2 years ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 5 months ago
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆767Dec 5, 2024Updated last year
- Pixel-Space Generative Models☆307May 11, 2025Updated 10 months ago