[ICCV2025]LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models
β82Sep 8, 2025Updated 6 months ago
Alternatives and similar repositories for LeanVAE
Users that are interested in LeanVAE are comparing it to the libraries listed below
Sorting:
- [CVPR 2025π₯] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Modelβ198May 11, 2025Updated 9 months ago
- Explore how to get a VQ-VAE models efficiently!β68Jul 24, 2025Updated 7 months ago
- Sentence VAE using the Transformer encoder-decoder architecture.β12Nov 30, 2024Updated last year
- This is the official code for the paper "EGVD: Event-Guided Video Diffusion Model for Physically Realistic Large-Motion Frame Interpolatiβ¦β17May 14, 2025Updated 9 months ago
- Official PyTorch implementation of FlowMo.β114Apr 7, 2025Updated 11 months ago
- β27May 3, 2024Updated last year
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compressionβ77Jul 30, 2025Updated 7 months ago
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamicsβ71Jan 13, 2026Updated last month
- DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Spaceβ351Oct 5, 2025Updated 5 months ago
- Official repository for βPixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Lossββ210Feb 3, 2026Updated last month
- [IEEE TIP 2024] Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Modelβ36Apr 24, 2024Updated last year
- [ICLR 2026] Self-Representation Alignment for Diffusion Transformers (SRA)β113Feb 22, 2026Updated 2 weeks ago
- Text-based real image editing with stable diffusion modelsβ27Dec 19, 2022Updated 3 years ago
- β66Dec 10, 2023Updated 2 years ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Modelsβ287Dec 4, 2024Updated last year
- [ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusioβ¦β100Feb 4, 2026Updated last month
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantizationβ55Sep 16, 2025Updated 5 months ago
- β32Sep 12, 2024Updated last year
- Official code for "Computationally-Efficient Neural Image Compression with Shallow Decoders", ICCV 2023β35Oct 15, 2024Updated last year
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspectiveβ79Oct 31, 2024Updated last year
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".β125Jun 18, 2025Updated 8 months ago
- Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".β37Apr 3, 2023Updated 2 years ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]β83Nov 17, 2024Updated last year
- [ECCV'24] Self-training Room Layout Estimation via Geometry-aware Ray-castingβ15Jan 20, 2025Updated last year
- Pure Java Llama2 inference with optional multi-GPU CUDA implementationβ13Sep 2, 2023Updated 2 years ago
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Modelsβ77Sep 11, 2024Updated last year
- Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generationβ95Dec 4, 2025Updated 3 months ago
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Needβ245Mar 11, 2025Updated 11 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ86Jul 16, 2024Updated last year
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"β90Oct 12, 2024Updated last year
- Data analysis scripts for Pufferβ11Jun 4, 2025Updated 9 months ago
- β62Jul 1, 2025Updated 8 months ago
- [CVPR 2021] FMO Deblurring Benchmarkβ13Jan 12, 2022Updated 4 years ago
- β15Nov 4, 2025Updated 4 months ago
- [ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAEβ395Jan 19, 2025Updated last year
- [ICCV 2025] GAS: Generative Avatar Synthesis from a Single Imageβ52Jan 1, 2026Updated 2 months ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carvingβ275Aug 4, 2025Updated 7 months ago
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.β99Nov 27, 2024Updated last year
- Implementation of "Analyzing and Improving the Training Dynamics of Diffusion Models"β97Feb 12, 2024Updated 2 years ago