baidu/ERNIE-Image

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/baidu/ERNIE-Image)

baidu / ERNIE-Image

ERNIE-Image is an open text-to-image generation model developed by the ERNIE-Image team at Baidu. It is built on a single-stream Diffusion Transformer (DiT), with only 8B DiT parameters, it reaches state-of-the-art performance among open-weight text-to-image models.

☆493

Alternatives and similar repositories for ERNIE-Image

Users that are interested in ERNIE-Image are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WithNucleusAI / Nucleus-Image
View on GitHub
NucleusImage training recipe
☆83Apr 9, 2026Updated 3 months ago
boogu-project / Boogu-Image
View on GitHub
Boogu-Image-0.1 is an Apache-2.0 open-source image generation and editing model family that delivers near-closed-source performance with …
☆833Updated this week
ATH-MaaS / Ovis-Image
View on GitHub
Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering, designed to operate efficiently under stri…
☆319May 15, 2026Updated 2 months ago
meituan-longcat / LongCat-Image
View on GitHub
☆709May 9, 2026Updated 2 months ago
inclusionAI / TwinFlow
View on GitHub
[ICLR 2026] Taming large-scale few-step training with self-adversarial flows! 👏🏻
☆536Feb 24, 2026Updated 5 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
FireRedTeam / FireRed-Image-Edit
View on GitHub
FireRed-Image-Edit is a powerful image editing foundation model achieving open-source state-of-the-art performance with precise instructi…
☆1,319Apr 3, 2026Updated 3 months ago
Tencent-Hunyuan / HunyuanImage-2.1
View on GitHub
HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation
☆673Oct 14, 2025Updated 9 months ago
ernie-research / NAVA
View on GitHub
Official Code of NAVA: Native Audio-Visual Alignment for Generation.
☆214Jun 30, 2026Updated 3 weeks ago
TencentYoutuResearch / T2I-L2P
View on GitHub
Code for "L2P: Unlocking Latent Potential for Pixel Generation"
☆179Jul 11, 2026Updated 2 weeks ago
Tongyi-MAI / Z-Image
View on GitHub
☆11,786Feb 9, 2026Updated 5 months ago
ideogram-oss / ideogram4
View on GitHub
Ideogram 4: Open image model at the forefront of design
☆2,583Jun 30, 2026Updated 3 weeks ago
jmliu206 / SeFi-Image
View on GitHub
☆144Updated this week
kandinskylab / kandinsky-5
View on GitHub
Kandinsky 5.0: A family of diffusion models for Video & Image generation
☆799Jul 5, 2026Updated 2 weeks ago
nv-tlabs / PiD
View on GitHub
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion
☆982Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Correr-Zhou / OmniShow
View on GitHub
[ICML 2026] ByteDance's All-in-One Video Generation Model for Human-Object Interaction Video Generation
☆459May 19, 2026Updated 2 months ago
UnicomAI / LeMiCa
View on GitHub
[NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation
☆122Jun 22, 2026Updated last month
QwenLM / Qwen-Image
View on GitHub
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
☆8,164Feb 10, 2026Updated 5 months ago
bytedance / Lance
View on GitHub
A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.
☆1,291Jul 14, 2026Updated last week
QwenLM / Qwen-Image-Bench
View on GitHub
☆128Jun 18, 2026Updated last month
Lakonik / LakonLab
View on GitHub
Official implementation of AsymFlow, pi-Flow, GMFlow
☆452Jul 14, 2026Updated last week
vita-epfl / RDM
View on GitHub
☆79Jul 3, 2026Updated 3 weeks ago
HiDream-ai / HiDream-O1-Image
View on GitHub
☆1,475Jun 22, 2026Updated last month
GordonChen19 / Prompt-Relay
View on GitHub
An inference-time, plug-and-play method for temporal control in multi-event generation
☆185Apr 26, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookresearch / tuna-2
View on GitHub
Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
☆739Updated this week
krea-ai / krea-2
View on GitHub
Official inference code for Krea 2
☆660Updated this week
vvvvvjdy / D-OPSD
View on GitHub
Official Repo of "D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models"
☆289May 22, 2026Updated 2 months ago
black-forest-labs / flux2
View on GitHub
Official inference repo for FLUX.2 models
☆2,545Mar 12, 2026Updated 4 months ago
OpenSenseNova / SenseNova-U1
View on GitHub
SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles
☆4,352Jul 16, 2026Updated last week
Francis-Rings / FlashPortrait
View on GitHub
[CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length vide…
☆479Feb 21, 2026Updated 5 months ago
vvvvvjdy / dmdr
View on GitHub
[ECCV 2026] Official Code of "Distribution Matching Distillation Meets Reinforcement Learning"
☆285Feb 1, 2026Updated 5 months ago
GAIR-NLP / daVinci-MagiHuman
View on GitHub
☆2,101Apr 11, 2026Updated 3 months ago
Tencent-Hunyuan / HunyuanImage-3.0
View on GitHub
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
☆3,199Jun 23, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PKU-YuanGroup / Edit-R1
View on GitHub
Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback
☆295Jan 24, 2026Updated 6 months ago
PKU-YuanGroup / Helios
View on GitHub
Helios: Real Real-Time Long Video Generation Model
☆1,999Jun 10, 2026Updated last month
aistudynow / Z-Image-Turbo-Lora-Stack-V4
View on GitHub
☆22Mar 6, 2026Updated 4 months ago
limuloo / RefineAnything
View on GitHub
☆223Apr 23, 2026Updated 3 months ago
ByteDance-Seed / SeedVR
View on GitHub
Repo for SeedVR2 (ICLR2026) & SeedVR (CVPR2025 Highlight)
☆1,281Jan 27, 2026Updated 5 months ago
jd-opensource / JoyAI-Image
View on GitHub
JoyAI-Image is the unified multimodal foundation model for image understanding, text-to-image generation, and instruction-guided image ed…
☆2,241Jul 17, 2026Updated last week
MC-E / InstructX
View on GitHub
☆86Oct 10, 2025Updated 9 months ago