westlake-repl / LeanVAEView external linksLinks
[ICCV2025]LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models
β79Sep 8, 2025Updated 5 months ago
Alternatives and similar repositories for LeanVAE
Users that are interested in LeanVAE are comparing it to the libraries listed below
Sorting:
- [CVPR 2025π₯] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Modelβ194May 11, 2025Updated 9 months ago
- Explore how to get a VQ-VAE models efficiently!β67Jul 24, 2025Updated 6 months ago
- Sentence VAE using the Transformer encoder-decoder architecture.β12Nov 30, 2024Updated last year
- This is the official code for the paper "EGVD: Event-Guided Video Diffusion Model for Physically Realistic Large-Motion Frame Interpolatiβ¦β17May 14, 2025Updated 9 months ago
- Official PyTorch implementation of FlowMo.β110Apr 7, 2025Updated 10 months ago
- β27May 3, 2024Updated last year
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compressionβ76Jul 30, 2025Updated 6 months ago
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamicsβ71Jan 13, 2026Updated last month
- DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Spaceβ345Oct 5, 2025Updated 4 months ago
- [IEEE TIP 2024] Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Modelβ34Apr 24, 2024Updated last year
- An auxiliary project analysis of the characteristics of KV in DiT Attention.β32Nov 29, 2024Updated last year
- β66Dec 10, 2023Updated 2 years ago
- [ICLR 2026] Self-Representation Alignment for Diffusion Transformers (SRA)β111Jan 26, 2026Updated 2 weeks ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Modelsβ286Dec 4, 2024Updated last year
- [ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusioβ¦β99Feb 4, 2026Updated last week
- Odysseus: Playground of LLM Sequence Parallelismβ79Jun 17, 2024Updated last year
- Official code for "Computationally-Efficient Neural Image Compression with Shallow Decoders", ICCV 2023β35Oct 15, 2024Updated last year
- β32Sep 12, 2024Updated last year
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspectiveβ79Oct 31, 2024Updated last year
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".β126Jun 18, 2025Updated 7 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]β84Nov 17, 2024Updated last year
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Modelsβ75Sep 11, 2024Updated last year
- Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generationβ95Dec 4, 2025Updated 2 months ago
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Needβ245Mar 11, 2025Updated 11 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ86Jul 16, 2024Updated last year
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)β34Apr 9, 2022Updated 3 years ago
- β10Jan 25, 2026Updated 2 weeks ago
- Code for the paper "IFFNeRF: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model"β12May 26, 2024Updated last year
- Data analysis scripts for Pufferβ11Jun 4, 2025Updated 8 months ago
- β15Nov 4, 2025Updated 3 months ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Modelβ14Jul 31, 2025Updated 6 months ago
- MCFL for Pedestrian attribute recognitionβ13Jul 20, 2020Updated 5 years ago
- [ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAEβ388Jan 19, 2025Updated last year
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carvingβ272Aug 4, 2025Updated 6 months ago
- [ICCV 2025] GAS: Generative Avatar Synthesis from a Single Imageβ52Jan 1, 2026Updated last month
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.β99Nov 27, 2024Updated last year
- [ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantizationβ197Dec 18, 2025Updated last month
- Official Implementation of pMF https://arxiv.org/abs/2601.22158β91Feb 5, 2026Updated last week
- β66Jul 8, 2025Updated 7 months ago