Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
☆1,475May 31, 2023Updated 2 years ago
Alternatives and similar repositories for unidiffuser
Users that are interested in unidiffuser are comparing it to the libraries listed below
Sorting:
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆1,097Mar 25, 2023Updated 2 years ago
- Official repo for consistency models.☆6,476Mar 22, 2024Updated last year
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,393May 31, 2024Updated last year
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,336Aug 10, 2023Updated 2 years ago
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆593Apr 23, 2024Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- T2I-Adapter☆3,797Jun 21, 2024Updated last year
- Consistency Distilled Diff VAE☆2,209Nov 7, 2023Updated 2 years ago
- ☆3,441May 14, 2024Updated last year
- Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)☆1,822Feb 6, 2024Updated 2 years ago
- Emu Series: Generative Multimodal Models from BAAI☆1,768Jan 12, 2026Updated last month
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆1,102Dec 22, 2025Updated 2 months ago
- [SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images☆506Oct 7, 2025Updated 4 months ago
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,560Dec 26, 2023Updated 2 years ago
- Open-Set Grounded Text-to-Image Generation☆2,196Mar 6, 2024Updated 2 years ago
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,615Jun 14, 2024Updated last year
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆716Jan 10, 2025Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,936Aug 15, 2024Updated last year
- Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)☆1,971Dec 1, 2025Updated 3 months ago
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆413Mar 25, 2024Updated last year
- ☆7,306Jul 2, 2024Updated last year
- Implementation of "SVDiff: Compact Parameter Space for Diffusion Fine-Tuning"☆384Jan 24, 2024Updated 2 years ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,177Nov 18, 2024Updated last year
- Speed up Stable Diffusion with this one simple trick!☆1,403Nov 29, 2023Updated 2 years ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,844Feb 1, 2025Updated last year
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆521Apr 2, 2024Updated last year
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆5,035Jan 9, 2026Updated last month
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆543Jan 8, 2024Updated 2 years ago
- VideoSys: An easy and efficient system for video generation☆2,016Aug 27, 2025Updated 6 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,252Feb 16, 2025Updated last year
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,641Oct 29, 2025Updated 4 months ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,058Sep 21, 2023Updated 2 years ago
- [NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models☆669Jul 17, 2024Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆996Nov 25, 2025Updated 3 months ago
- Official JAX implementation of MAGVIT: Masked Generative Video Transformer☆995Jan 17, 2024Updated 2 years ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆13,864Feb 29, 2024Updated 2 years ago
- Official implementation of SEED-LLaMA (ICLR 2024).☆642Sep 21, 2024Updated last year
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆7,526Mar 22, 2024Updated last year
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability☆953Nov 11, 2023Updated 2 years ago