VinAIResearch / DiMSUM
DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation (NeurIPS 2024)
β18Updated last month
Alternatives and similar repositories for DiMSUM:
Users that are interested in DiMSUM are comparing it to the libraries listed below
- π Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)β88Updated last year
- β82Updated last year
- Official repository for "SODA: Bottleneck Diffusion Models for Representation Learning"β22Updated 10 months ago
- β70Updated 3 months ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencodersβ99Updated last month
- Official implementation of "Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive" (ICLR 2024)β52Updated 4 months ago
- βFlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matchingβ FlowAR employs a simplest scale design and is compatible with anβ¦β84Updated last month
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Visionβ68Updated 7 months ago
- The official PyTorch implementation of Fast Diffusion Modelβ94Updated last year
- Dataset Diffusion: Diffusion-based Synthetic Data Generation for Pixel-Level Semantic Segmentation (NeurIPS2023)β113Updated 4 months ago
- Unofficial implementation of "SODA: Bottleneck Diffusion Models for Representation Learning"β80Updated 10 months ago
- More dimensions = More funβ21Updated 6 months ago
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]β67Updated last year
- Augmenting with Language-guided Image Augmentation (ALIA)β70Updated last year
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"β48Updated 2 years ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]β31Updated this week
- LOCO-Editβ20Updated 3 months ago
- β38Updated 8 months ago
- The official implementation of "[MASK] is All You Need"β104Updated last month
- Code for the paper "Do text-free diffusion models learn discriminative visual representations?"β21Updated last year
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"β40Updated 7 months ago
- [ECCV 2024] Official repository of ECCV 2024 paper: Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Mβ¦β13Updated 3 weeks ago
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.β64Updated 10 months ago
- The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesisβ173Updated 7 months ago
- The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".β47Updated 9 months ago
- Transformer-Mamba Diffusion Modelsβ95Updated 7 months ago
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"β178Updated 4 months ago
- Personalized Representation from Personalized Generationβ49Updated last month
- source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"β64Updated last week
- Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024β59Updated 7 months ago