PKU-YuanGroup / WF-VAE
Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
☆109Updated this week
Alternatives and similar repositories for WF-VAE:
Users that are interested in WF-VAE are comparing it to the libraries listed below
- This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolu…☆124Updated 2 weeks ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆153Updated 9 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 6 months ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆45Updated 4 months ago
- ☆107Updated 10 months ago
- The official implementation of PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/☆106Updated 2 weeks ago
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆108Updated 2 months ago
- ☆66Updated 7 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆133Updated 7 months ago
- ArXiv paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆58Updated 3 months ago
- [AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with…☆108Updated 3 months ago
- ☆221Updated 6 months ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆88Updated this week
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects"☆64Updated 7 months ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆102Updated last month
- [ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models☆87Updated 4 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆95Updated 5 months ago
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆95Updated 11 months ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆123Updated 3 months ago
- ☆41Updated last month
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆66Updated 3 weeks ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"☆67Updated last month
- Vico: Compositional Video Generation as Flow Equalization☆54Updated 2 months ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆59Updated this week
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Updated last year
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆47Updated 3 months ago
- [ARXIV'24] StyleMaster: Stylize Your Video with Artistic Generation and Translation☆72Updated last month
- Blending Custom Photos with Video Diffusion Transformers☆37Updated last week
- Training-Free Condition-Guided Text-to-Video Generation☆59Updated last year
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆33Updated last month