CompVis / zigma
A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)
☆290Updated last month
Alternatives and similar repositories for zigma:
Users that are interested in zigma are comparing it to the libraries listed below
- The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis☆173Updated 6 months ago
- Scalable Diffusion Models with State Space Backbone☆149Updated 10 months ago
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆545Updated 8 months ago
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆176Updated 3 months ago
- Transformer-Mamba Diffusion Models☆94Updated 6 months ago
- XQ-GAN🚀: An Open-source Image Tokenization Framework for Autoregressive Generation☆178Updated last month
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆73Updated 6 months ago
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆393Updated 2 months ago
- [ICCV 2023 Oral] Official Implementation of "Denoising Diffusion Autoencoders are Unified Self-supervised Learners"☆154Updated 10 months ago
- ☆81Updated last year
- Code for Fast Training of Diffusion Models with Masked Transformers☆385Updated 8 months ago
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆480Updated 2 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆112Updated 2 weeks ago
- Open source implementation of "Vision Transformers Need Registers"☆162Updated 2 months ago
- [NeurIPS 2024] Official implementation of "BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models…☆101Updated last month
- Official PyTorch implementation of Video Probabilistic Diffusion Models in Projected Latent Space (CVPR 2023).☆314Updated 8 months ago
- This is the official implementation for ControlVAR.☆88Updated last month
- [NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"☆325Updated 3 months ago
- This repo contains the code for 1D tokenizer and generator☆645Updated this week
- Official pytorch repository for “Guidance with Spherical Gaussian Constraint for Conditional Diffusion”☆51Updated 6 months ago
- Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆395Updated 2 months ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆166Updated last year
- The official implementation of "[MASK] is All You Need"☆104Updated last month
- Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆791Updated last month
- ☆69Updated 2 months ago
- This is the official code release for our work, Denoising Vision Transformers.☆351Updated 2 months ago
- Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.☆305Updated 6 months ago
- 🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"☆225Updated 2 years ago
- [CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation☆273Updated 8 months ago
- [ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxi…☆223Updated 8 months ago