AILab-CVC / CV-VAE
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
☆267Updated 3 months ago
Alternatives and similar repositories for CV-VAE:
Users that are interested in CV-VAE are comparing it to the libraries listed below
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆252Updated last week
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆406Updated last week
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆137Updated 3 weeks ago
- ☆189Updated last month
- The code of our work "Golden Noise for Diffusion Models: A Learning Framework".☆142Updated 3 weeks ago
- VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆295Updated last month
- This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolu…☆137Updated 2 weeks ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆201Updated last month
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆109Updated last month
- Official PyTorch implementation for ICLR2024 paper "The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing"☆108Updated last year
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".☆141Updated 2 months ago
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)☆239Updated 8 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆240Updated 6 months ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆130Updated 5 months ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆158Updated 11 months ago
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need☆193Updated this week
- [ICLR2025]☆138Updated last month
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆491Updated 9 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆78Updated 3 weeks ago
- Code for FreeScale, a tuning-free method for higher-resolution visual generation☆116Updated this week
- [CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation☆279Updated 10 months ago
- GenEval: An object-focused framework for evaluating text-to-image alignment☆192Updated last week
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆161Updated 5 months ago
- Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆185Updated 2 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆139Updated last month
- Code repository for T2V-Turbo and T2V-Turbo-v2☆290Updated last month
- [CVPR 2025] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆434Updated this week
- This is the official implementation for ControlVAR.☆96Updated 3 months ago
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆203Updated last month