ERNIE-Image is an open text-to-image generation model developed by the ERNIE-Image team at Baidu. It is built on a single-stream Diffusion Transformer (DiT), with only 8B DiT parameters, it reaches state-of-the-art performance among open-weight text-to-image models.
☆223Apr 16, 2026Updated this week
Alternatives and similar repositories for ERNIE-Image
Users that are interested in ERNIE-Image are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- "Roll with the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning" by Yue Duan (AAAI 2024…☆13Nov 20, 2025Updated 4 months ago
- Chroma key (green screen removal) algorithms with Python☆11Jul 14, 2024Updated last year
- ☆14Nov 24, 2023Updated 2 years ago
- Neural Reflectance Field from Shading and Shadow under a Fixed Viewpoint☆16Aug 8, 2022Updated 3 years ago
- One Diffusion model implementation base on LibTorch☆13Mar 22, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CUDA 12.8 accelerated prebuilt wheel of llama-cpp-python with full Gemma 3 model support for Windows 10/11 (x64). Built by boneylizard.☆19Aug 24, 2025Updated 7 months ago
- InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity☆12Jan 3, 2026Updated 3 months ago
- ☆12Jul 19, 2023Updated 2 years ago
- ☆12Mar 16, 2022Updated 4 years ago
- A Hierarchical Approach for Generating Descriptive Image Paragraphs☆10Mar 27, 2020Updated 6 years ago
- dataset☆19Jul 20, 2023Updated 2 years ago
- ☆13Jul 20, 2022Updated 3 years ago
- ☆23Jun 13, 2025Updated 10 months ago
- ☆45Oct 28, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- TorchRL is a C++ reinforcement library using PyTorch C++ backend LibTorch☆10Jul 20, 2022Updated 3 years ago
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆22Oct 28, 2025Updated 5 months ago
- JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers☆17Jul 21, 2025Updated 8 months ago
- [NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and R…☆38Sep 27, 2025Updated 6 months ago
- 基于千问3 TTS实现的音色创造+音色克隆Web应用!☆23Jan 4, 2026Updated 3 months ago
- A volume rendering using GLSL☆10Apr 26, 2022Updated 3 years ago
- Generative model for 3D objects.☆18Aug 12, 2023Updated 2 years ago
- ICCV DeeperAction Challenge - Kinetics-TPS Challenge on Part-level Action Parsing and Action Recognition.☆14Jun 4, 2021Updated 4 years ago
- [TMM 2023] Language-Guided Face Animation by Recurrent StyleGAN-based Generator☆11Apr 23, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- C2-Matching for CUDA11☆11Aug 24, 2023Updated 2 years ago
- ☆21Jul 1, 2021Updated 4 years ago
- QVAC Fabric: cross-platform LLM inference and fine-tuning, optimized for edge devices and heterogenous GPUs☆88Apr 7, 2026Updated last week
- Source code for UP-Diff☆15Nov 26, 2024Updated last year
- UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions☆48Dec 16, 2025Updated 4 months ago
- ☆27Oct 30, 2025Updated 5 months ago
- a Dify plugin for LM Studio☆22Mar 18, 2025Updated last year
- ☆41Dec 3, 2025Updated 4 months ago
- Source code and model weights for the PGGAN model utilised for the paper: Evaluating the Clinical Realism of Synthetic Chest X-Rays Gener…☆12Mar 2, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Video models as pure visual reasoners for high-quality text-to-image generation via Chain-of-Frame reasoning.☆37Jan 16, 2026Updated 3 months ago
- Tensorflow implementation of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs☆15Apr 27, 2018Updated 7 years ago
- The Official PyTorch implementation of CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Dif…☆16Jan 14, 2025Updated last year
- ☆31Mar 5, 2025Updated last year
- Unofficial implementation of deepseek/Janus in ComfyUI.☆17Mar 12, 2025Updated last year
- 用open-cv检测物体的大小,是实时的☆17Jan 7, 2022Updated 4 years ago
- ☆20Mar 29, 2023Updated 3 years ago