westlake-repl / LeanVAELinks
[ICCV2025]LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models
β49Updated last month
Alternatives and similar repositories for LeanVAE
Users that are interested in LeanVAE are comparing it to the libraries listed below
Sorting:
- [CVPR 2025π₯] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Modelβ173Updated 5 months ago
- (SRA) No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselvesβ90Updated 2 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]β83Updated 11 months ago
- Transition Modelsβ128Updated last week
- [ECCV2024] "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liuβ55Updated 10 months ago
- This is the official implementation for ControlVAR.β122Updated 10 months ago
- β41Updated 4 months ago
- SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation (CVPR 2024)β67Updated 2 months ago
- [ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformersβ363Updated 3 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsβ146Updated 8 months ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Modelsβ42Updated last year
- PyTorch implementation of One-step Diffusion with Distribution Matching Distillationβ38Updated last year
- Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generationβ89Updated 3 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generationβ122Updated 10 months ago
- [CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)β98Updated 4 months ago
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"β88Updated 6 months ago
- β25Updated 7 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiTβ138Updated 6 months ago
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.β154Updated 3 months ago
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Modelsβ81Updated last month
- Autoregressive Image Generation with Randomized Parallel Decodingβ77Updated 6 months ago
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"β218Updated last year
- [ICML 2025] Official code for the paper 'DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space'β62Updated 2 months ago
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Modelsβ75Updated last year
- β74Updated 7 months ago
- [ArXiv 2025] Follow-Your-Shape: This repo is the official implementation of "Follow-Your-Shape: Shape-Aware Image Editing via Trajectoryβ¦β50Updated 2 months ago
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"β140Updated 9 months ago
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"β38Updated 7 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-projectβ174Updated 6 months ago
- FACM: Flow-Anchored Consistency Modelsβ121Updated 2 months ago