PKU-YuanGroup / WF-VAELinks
[CVPR 2025π₯] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
β180Updated 6 months ago
Alternatives and similar repositories for WF-VAE
Users that are interested in WF-VAE are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)β101Updated 5 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generationβ125Updated 11 months ago
- This is the official implementation for ControlVAR.β122Updated 11 months ago
- Transition Modelsβ132Updated last month
- Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiTβ148Updated 3 weeks ago
- (SRA) No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselvesβ92Updated 3 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-projectβ178Updated 7 months ago
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Modelsβ75Updated last year
- Training-Free Text-Guided Image Editing Using Visual Autoregressive Modelβ68Updated 6 months ago
- Training-Free Condition-Guided Text-to-Video Generationβ60Updated 3 weeks ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Modelsβ42Updated last year
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsβ147Updated 8 months ago
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"β105Updated last year
- Pixel-Space Generative Modelsβ274Updated 6 months ago
- Implementation of paper EditCLIP: Representation Learning for Image Editing (ICCV 2025)β31Updated 4 months ago
- CCEdit: Creative and Controllable Video Editing via Diffusion Modelsβ113Updated last year
- Code for D-DiTβ51Updated 7 months ago
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Modelsβ303Updated 6 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generationβ108Updated last month
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Modelsβ84Updated 2 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151β85Updated 6 months ago
- PyTorch implementation of One-step Diffusion with Distribution Matching Distillationβ44Updated last year
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"β277Updated 6 months ago
- [ICLR2025] The code of Z-Sampling, proposed in our paper "Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflectioβ¦β94Updated 8 months ago
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"β141Updated 10 months ago
- β78Updated 8 months ago
- [ICCV2025]LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Modelsβ54Updated 2 months ago
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformersβ73Updated 3 months ago
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLMβ69Updated 3 months ago
- ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Modelsβ185Updated 2 months ago