thu-ml/unidiffuser

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thu-ml/unidiffuser)

thu-ml / unidiffuser

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

☆1,486

Alternatives and similar repositories for unidiffuser

Users that are interested in unidiffuser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

baofff / U-ViT
View on GitHub
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
☆1,107Mar 25, 2023Updated 3 years ago
SHI-Labs / Versatile-Diffusion
View on GitHub
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
☆1,334Aug 10, 2023Updated 2 years ago
sail-sg / MDT
View on GitHub
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
☆595Apr 23, 2024Updated 2 years ago
openai / consistency_models
View on GitHub
Official repo for consistency models.
☆6,491Mar 22, 2024Updated 2 years ago
facebookresearch / DiT
View on GitHub
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,681May 31, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
openai / consistencydecoder
View on GitHub
Consistency Distilled Diff VAE
☆2,213Nov 7, 2023Updated 2 years ago
TencentARC / T2I-Adapter
View on GitHub
T2I-Adapter
☆3,806Jun 21, 2024Updated 2 years ago
PixArt-alpha / PixArt-alpha
View on GitHub
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
☆3,298Oct 31, 2024Updated last year
LuChengTHU / dpm-solver
View on GitHub
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
☆1,850Feb 6, 2024Updated 2 years ago
google / prompt-to-prompt
View on GitHub
☆3,456May 14, 2024Updated 2 years ago
baaivision / Emu
View on GitHub
Emu Series: Generative Multimodal Models from BAAI
☆1,776Jan 12, 2026Updated 6 months ago
willisma / SiT
View on GitHub
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
☆1,186Dec 22, 2025Updated 6 months ago
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,959Aug 15, 2024Updated last year
mit-han-lab / fastcomposer
View on GitHub
[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
☆715Jan 10, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ziqihuangg / ReVersion
View on GitHub
[SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images
☆504Oct 7, 2025Updated 9 months ago
csyxwei / ELITE
View on GitHub
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)
☆541Jan 8, 2024Updated 2 years ago
ali-vilab / composer
View on GitHub
Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"
☆1,560Dec 26, 2023Updated 2 years ago
Zhendong-Wang / Prompt-Diffusion
View on GitHub
Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"
☆414Mar 25, 2024Updated 2 years ago
salesforce / LAVIS
View on GitHub
LAVIS - A One-stop Library for Language-Vision Intelligence
☆11,251Jun 2, 2026Updated last month
openai / guided-diffusion
View on GitHub
☆7,403Jul 2, 2024Updated 2 years ago
gligen / GLIGEN
View on GitHub
Open-Set Grounded Text-to-Image Generation
☆2,226Mar 6, 2024Updated 2 years ago
mkshing / svdiff-pytorch
View on GitHub
Implementation of "SVDiff: Compact Parameter Space for Diffusion Fine-Tuning"
☆386Jan 24, 2024Updated 2 years ago
adobe-research / custom-diffusion
View on GitHub
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
☆1,976May 24, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
luosiallen / latent-consistency-model
View on GitHub
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
☆4,615Jun 14, 2024Updated 2 years ago
Picsart-AI-Research / PAIR-Diffusion
View on GitHub
[CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor
☆521Apr 2, 2024Updated 2 years ago
omerbt / MultiDiffusion
View on GitHub
Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …
☆1,062Sep 21, 2023Updated 2 years ago
YangLing0818 / RPG-DiffusionMaster
View on GitHub
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
☆1,838Feb 1, 2025Updated last year
NUS-HPC-AI-Lab / VideoSys
View on GitHub
VideoSys: An easy and efficient system for video generation
☆2,025Aug 27, 2025Updated 10 months ago
dbolya / tomesd
View on GitHub
Speed up Stable Diffusion with this one simple trick!
☆1,405Nov 29, 2023Updated 2 years ago
AILab-CVC / SEED
View on GitHub
Official implementation of SEED-LLaMA (ICLR 2024).
☆642Sep 21, 2024Updated last year
AILab-CVC / VideoCrafter
View on GitHub
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
☆5,066Jan 9, 2026Updated 6 months ago
ShihaoZhaoZSH / Uni-ControlNet
View on GitHub
[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
☆670Jul 17, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
CompVis / latent-diffusion
View on GitHub
High-Resolution Image Synthesis with Latent Diffusion Models
☆14,108Feb 29, 2024Updated 2 years ago
google-research / magvit
View on GitHub
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
☆1,002Jan 17, 2024Updated 2 years ago
cloneofsimo / lora
View on GitHub
Using Low-rank adaptation to quickly fine-tune diffusion models.
☆7,547Mar 22, 2024Updated 2 years ago
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,016Nov 25, 2025Updated 7 months ago
zai-org / ImageReward
View on GitHub
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
☆1,694Oct 29, 2025Updated 8 months ago
Alpha-VLLM / Lumina-T2X
View on GitHub
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,247Feb 16, 2025Updated last year
Anima-Lab / MaskDiT
View on GitHub
Code for Fast Training of Diffusion Models with Masked Transformers
☆428May 15, 2024Updated 2 years ago