Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
☆1,476May 31, 2023Updated 2 years ago
Alternatives and similar repositories for unidiffuser
Users that are interested in unidiffuser are comparing it to the libraries listed below
Sorting:
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆1,099Mar 25, 2023Updated 2 years ago
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,336Aug 10, 2023Updated 2 years ago
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆595Apr 23, 2024Updated last year
- Official repo for consistency models.☆6,476Mar 22, 2024Updated last year
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,410May 31, 2024Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- Consistency Distilled Diff VAE☆2,212Nov 7, 2023Updated 2 years ago
- T2I-Adapter☆3,802Jun 21, 2024Updated last year
- Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)☆1,826Feb 6, 2024Updated 2 years ago
- ☆3,443May 14, 2024Updated last year
- Emu Series: Generative Multimodal Models from BAAI☆1,772Jan 12, 2026Updated 2 months ago
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆1,120Dec 22, 2025Updated 2 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,939Aug 15, 2024Updated last year
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆543Jan 8, 2024Updated 2 years ago
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆716Jan 10, 2025Updated last year
- [SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images☆505Oct 7, 2025Updated 5 months ago
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,559Dec 26, 2023Updated 2 years ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,189Nov 18, 2024Updated last year
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆413Mar 25, 2024Updated last year
- ☆7,318Jul 2, 2024Updated last year
- Implementation of "SVDiff: Compact Parameter Space for Diffusion Fine-Tuning"☆384Jan 24, 2024Updated 2 years ago
- Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)☆1,971Dec 1, 2025Updated 3 months ago
- Open-Set Grounded Text-to-Image Generation☆2,210Mar 6, 2024Updated 2 years ago
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,615Jun 14, 2024Updated last year
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆521Apr 2, 2024Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,843Feb 1, 2025Updated last year
- Speed up Stable Diffusion with this one simple trick!☆1,402Nov 29, 2023Updated 2 years ago
- VideoSys: An easy and efficient system for video generation☆2,020Aug 27, 2025Updated 6 months ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,057Sep 21, 2023Updated 2 years ago
- Official implementation of SEED-LLaMA (ICLR 2024).☆642Sep 21, 2024Updated last year
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆5,037Jan 9, 2026Updated 2 months ago
- [NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models☆669Jul 17, 2024Updated last year
- High-Resolution Image Synthesis with Latent Diffusion Models☆13,906Feb 29, 2024Updated 2 years ago
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆7,529Mar 22, 2024Updated last year
- Official JAX implementation of MAGVIT: Masked Generative Video Transformer☆995Jan 17, 2024Updated 2 years ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆998Nov 25, 2025Updated 3 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,650Oct 29, 2025Updated 4 months ago
- Code for Fast Training of Diffusion Models with Masked Transformers☆422May 15, 2024Updated last year