PKU-YuanGroup / WF-VAELinks
[CVPR 2025π₯] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
β167Updated 4 months ago
Alternatives and similar repositories for WF-VAE
Users that are interested in WF-VAE are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)β97Updated 3 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generationβ123Updated 9 months ago
- This is the official implementation for ControlVAR.β120Updated 9 months ago
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Modelsβ75Updated last year
- (SRA) No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselvesβ85Updated last month
- CCEdit: Creative and Controllable Video Editing via Diffusion Modelsβ115Updated last year
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsβ145Updated 6 months ago
- Training-Free Text-Guided Image Editing Using Visual Autoregressive Modelβ57Updated 4 months ago
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"β106Updated last year
- Training-Free Condition-Guided Text-to-Video Generationβ61Updated 5 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiTβ129Updated 4 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-projectβ171Updated 5 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generationβ106Updated last week
- Pixel-Space Generative Modelsβ270Updated 4 months ago
- Transition Modelsβ101Updated last week
- Implementation of paper EditCLIP: Representation Learning for Image Editing (ICCV 2025)β26Updated 2 months ago
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"β249Updated 4 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animatorβ94Updated last year
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151β83Updated 4 months ago
- Lumos Project: Frontier generative model research by Alibaba DAMO Academy, including Lumos-1, etc.β128Updated last month
- The official implementation of "2025ICLR Dynamic Diffusion Transformer" and "2025ArXivDyDiT++: Dynamic Diffusion Transformers for Efficieβ¦β41Updated 5 months ago
- I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Modelsβ178Updated last week
- [ICCV2025]LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Modelsβ42Updated 2 months ago
- β119Updated 3 weeks ago
- [ICLR 2024] Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach Link: https://arxiv.oβ¦β81Updated last year
- Code for D-DiTβ46Updated 5 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Compositionβ165Updated last week
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformersβ71Updated last month
- [ICLR2025] The code of Z-Sampling, proposed in our paper "Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflectioβ¦β83Updated 6 months ago
- [NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillationβ67Updated 10 months ago