ERNIE-Image is an open text-to-image generation model developed by the ERNIE-Image team at Baidu. It is built on a single-stream Diffusion Transformer (DiT), with only 8B DiT parameters, it reaches state-of-the-art performance among open-weight text-to-image models.
☆470Apr 17, 2026Updated last month
Alternatives and similar repositories for ERNIE-Image
Users that are interested in ERNIE-Image are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation☆51Apr 9, 2026Updated 2 months ago
- ☆54Jun 3, 2026Updated last week
- Wrapper of DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion, run in diffusers mode☆30Feb 26, 2026Updated 3 months ago
- Unofficial ComfyUI implementation of UniWorld.☆21Jun 10, 2025Updated last year
- Chroma key (green screen removal) algorithms with Python☆11Jul 14, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Nov 24, 2023Updated 2 years ago
- media sentence repeater ( mp3, mp4 ) with srt subtitle file, based on Filebrowser, for language learning, offline app, tts, and a to-do-l…☆14May 28, 2026Updated 2 weeks ago
- Neural Reflectance Field from Shading and Shadow under a Fixed Viewpoint☆16Aug 8, 2022Updated 3 years ago
- [CVPR 2026 Oral] Official implementation for ChordEdit: One-Step Low-Energy Transport for Image Editing☆146May 13, 2026Updated 3 weeks ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆62Dec 16, 2025Updated 5 months ago
- The official repository of EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model (CVPRW 2026)☆49Apr 19, 2026Updated last month
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆66Apr 28, 2026Updated last month
- Official implementation for SSDD Single-Step Diffusion Decoder for Efficient Image Tokenization.☆63Mar 16, 2026Updated 2 months ago
- Official implementation of CGF paper Two-step Training: Adjustable Sketch Colorization via Reference Image and Text Tag☆21Aug 31, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Deep learning chat based on DL4J☆11Mar 31, 2017Updated 9 years ago
- Fast as Hamster | Stable Hamster | Stable Diffusion☆15Aug 26, 2024Updated last year
- Basel morphable face model mesh and texture generator using GPU.☆14Sep 14, 2020Updated 5 years ago
- CUDA 12.8 accelerated prebuilt wheel of llama-cpp-python with full Gemma 3 model support for Windows 10/11 (x64). Built by boneylizard.☆17Aug 24, 2025Updated 9 months ago
- This repository is the official implementation of FLUX-CustomID. It is capable of generating images based on your face image at a level e…☆25Nov 13, 2024Updated last year
- A prototype app that controls an iPhone X with face gestures, using Apple's ARKit☆14May 6, 2019Updated 7 years ago
- InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity☆12Jan 3, 2026Updated 5 months ago
- Custom nodes by IAMCCS for ComfyUI — includes WANAnimate LoRA Loader Fix and cinematic extensions.☆101Updated this week
- ☆25Jun 5, 2026Updated last week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Unofficial Windows wheel package for the Nunchaku (SVDQuant) library.☆14Mar 9, 2025Updated last year
- DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning☆24Apr 17, 2021Updated 5 years ago
- dataset☆19Jul 20, 2023Updated 2 years ago
- Make HiDream-I1 avialbe in ComfyUI.☆10Apr 14, 2025Updated last year
- NextFlow🚀: Unified Sequential Modeling Activates Multimodal Understanding and Generation☆330Jan 9, 2026Updated 5 months ago
- UniMesh: Unifying 3D Mesh Understanding and Generation☆57May 8, 2026Updated last month
- Re:Pitch is a WebApp for speeding up and resampling audio files. It is written in JS and uses only the WebAudioAPI.☆16May 28, 2026Updated 2 weeks ago
- ☆22Jan 19, 2026Updated 4 months ago
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆48Apr 15, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A collection of Bootstrap 4 snippets for Brackets. Should contribute to develop !!!☆13May 23, 2024Updated 2 years ago
- ☆13Jul 20, 2022Updated 3 years ago
- A fully cuda implementation of DCNv2(deformable convolution) forward. Without dependent of cuTorch(THC).☆10Dec 9, 2019Updated 6 years ago
- ☆23Jun 13, 2025Updated 11 months ago
- ☆46Oct 28, 2025Updated 7 months ago
- Today I Learnd☆10Mar 30, 2021Updated 5 years ago
- Code repo for EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation☆41Mar 6, 2026Updated 3 months ago