AILab-CVC / CV-VAE
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
☆262Updated 2 months ago
Alternatives and similar repositories for CV-VAE:
Users that are interested in CV-VAE are comparing it to the libraries listed below
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆239Updated this week
- VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆278Updated 3 weeks ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆135Updated 7 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆382Updated last week
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)☆237Updated 7 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆154Updated 4 months ago
- ☆187Updated this week
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆231Updated 5 months ago
- This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolu…☆129Updated last week
- ☆107Updated 11 months ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆102Updated 4 months ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆103Updated last month
- This is the official implementation for ControlVAR.☆94Updated 2 months ago
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need☆173Updated 2 months ago
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".☆134Updated 2 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆284Updated 2 weeks ago
- ☆123Updated this week
- ☆100Updated 7 months ago
- [ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxi…☆227Updated 9 months ago
- Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Vi…☆149Updated 3 months ago
- Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆114Updated 2 weeks ago
- Code for FreeScale, a tuning-free method for higher-resolution visual generation☆114Updated last month
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆482Updated 8 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆135Updated 2 weeks ago
- VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models (CVPR 2024)☆187Updated 10 months ago
- [CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation☆276Updated 9 months ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆155Updated 10 months ago
- Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)☆135Updated 8 months ago
- Scaling Diffusion Transformers with Mixture of Experts☆252Updated 5 months ago
- [ICLR25] High-performance Image Tokenizers for VAR and AR☆194Updated this week