PixArt-alpha/PixArt-sigma

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PixArt-alpha/PixArt-sigma)

PixArt-alpha / PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

☆1,933

Alternatives and similar repositories for PixArt-sigma

Users that are interested in PixArt-sigma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PixArt-alpha / PixArt-alpha
View on GitHub
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
☆3,299Oct 31, 2024Updated last year
Alpha-VLLM / Lumina-T2X
View on GitHub
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,248Feb 16, 2025Updated last year
TencentQQGYLab / ELLA
View on GitHub
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
☆1,285Jul 17, 2024Updated 2 years ago
Tencent-Hunyuan / HunyuanDiT
View on GitHub
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
☆4,292Nov 27, 2025Updated 8 months ago
NUS-HPC-AI-Lab / VideoSys
View on GitHub
VideoSys: An easy and efficient system for video generation
☆2,025Aug 27, 2025Updated 11 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
instantX-research / InstantStyle
View on GitHub
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
☆2,011Sep 18, 2024Updated last year
Vchitect / Latte
View on GitHub
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
☆1,948Oct 30, 2025Updated 8 months ago
G-U-N / Phased-Consistency-Model
View on GitHub
[NeurIPS 2024] Boosting the performance of consistency models with PCM!
☆520Dec 11, 2024Updated last year
megvii-research / HiDiffusion
View on GitHub
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
☆841Jan 7, 2026Updated 6 months ago
bytedance / res-adapter
View on GitHub
[AAAI 2025] Official codes of "ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models".
☆760Apr 27, 2025Updated last year
tianweiy / DMD2
View on GitHub
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
☆1,415Mar 5, 2025Updated last year
tencent-ailab / IP-Adapter
View on GitHub
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
☆6,647Jun 28, 2024Updated 2 years ago
lllyasviel / LayerDiffuse
View on GitHub
Transparent Image Layer Diffusion using Latent Transparency
☆2,218Jun 16, 2024Updated 2 years ago
ChenyangSi / FreeU
View on GitHub
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
☆1,899Dec 24, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Doubiiu / DynamiCrafter
View on GitHub
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
☆3,007Sep 8, 2024Updated last year
facebookresearch / DiT
View on GitHub
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,693May 31, 2024Updated 2 years ago
lllyasviel / IC-Light
View on GitHub
More relighting!
☆8,477Feb 20, 2025Updated last year
magic-research / piecewise-rectified-flow
View on GitHub
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
☆538Sep 8, 2025Updated 10 months ago
Kwai-Kolors / Kolors
View on GitHub
Kolors Team
☆4,609Nov 13, 2024Updated last year
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,960Aug 15, 2024Updated last year
city96 / ComfyUI_ExtraModels
View on GitHub
Support for miscellaneous image models. Currently supports: DiT, PixArt, HunYuanDiT, MiaoBi, and a few VAEs.
☆537Dec 17, 2024Updated last year
guoyww / AnimateDiff
View on GitHub
Official implementation of AnimateDiff.
☆12,195Jul 31, 2024Updated last year
XLabs-AI / x-flux
View on GitHub
☆2,230Nov 8, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JIA-Lab-research / ControlNeXt
View on GitHub
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
☆1,643Sep 25, 2024Updated last year
openai / consistencydecoder
View on GitHub
Consistency Distilled Diff VAE
☆2,213Nov 7, 2023Updated 2 years ago
willisma / SiT
View on GitHub
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
☆1,193Dec 22, 2025Updated 7 months ago
ali-vilab / VGen
View on GitHub
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
☆3,155Jan 10, 2025Updated last year
luosiallen / latent-consistency-model
View on GitHub
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
☆4,614Jun 14, 2024Updated 2 years ago
YangLing0818 / RPG-DiffusionMaster
View on GitHub
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
☆1,840Feb 1, 2025Updated last year
showlab / X-Adapter
View on GitHub
[CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
☆770Aug 14, 2024Updated last year
mira-space / Mira
View on GitHub
☆362Oct 21, 2024Updated last year
Alpha-VLLM / Lumina-mGPT
View on GitHub
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…
☆646Oct 16, 2025Updated 9 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
TencentARC / T2I-Adapter
View on GitHub
T2I-Adapter
☆3,803Jun 21, 2024Updated 2 years ago
gnobitab / InstaFlow
View on GitHub
InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
☆1,407Jun 7, 2024Updated 2 years ago
jabir-zheng / TCD
View on GitHub
Official Repository of the paper "Trajectory Consistency Distillation"
☆361Apr 28, 2024Updated 2 years ago
horseee / DeepCache
View on GitHub
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
☆970Jun 27, 2024Updated 2 years ago
NVlabs / Sana
View on GitHub
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
☆8,609Updated this week
IDKiro / sdxs
View on GitHub
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions
☆664May 27, 2024Updated 2 years ago
PKU-YuanGroup / Open-Sora-Plan
View on GitHub
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
☆12,155Mar 8, 2026Updated 4 months ago