AIGText/Glyph-ByT5

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AIGText/Glyph-ByT5)

AIGText / Glyph-ByT5

[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering""

☆621

Alternatives and similar repositories for Glyph-ByT5

Users that are interested in Glyph-ByT5 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OPPO-Mente-Lab / GlyphDraw2
View on GitHub
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
☆88Jul 11, 2024Updated last year
AIGText / GlyphControl-release
View on GitHub
[NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"
☆238Jul 11, 2024Updated last year
tyxsspa / AnyText
View on GitHub
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
☆4,848Mar 7, 2025Updated last year
TencentQQGYLab / ELLA
View on GitHub
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
☆1,286Jul 17, 2024Updated last year
megvii-research / HiDiffusion
View on GitHub
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
☆842Jan 7, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Shakker-Labs / RepText
View on GitHub
RepText: Rendering Visual Text via Replicating 🔥
☆140Jun 7, 2025Updated 11 months ago
design-edit / DesignEdit
View on GitHub
[AAAI2025] DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework
☆367Dec 10, 2024Updated last year
ID-Animator / ID-Animator
View on GitHub
☆384Jun 6, 2024Updated last year
Tencent-Hunyuan / HunyuanDiT
View on GitHub
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
☆4,294Nov 27, 2025Updated 6 months ago
donahowe / AutoStudio
View on GitHub
[CVPRW 2026] AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
☆448Apr 13, 2025Updated last year
YangLing0818 / RPG-DiffusionMaster
View on GitHub
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
☆1,842Feb 1, 2025Updated last year
instantX-research / InstantStyle
View on GitHub
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
☆2,006Sep 18, 2024Updated last year
Alpha-VLLM / Lumina-T2X
View on GitHub
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,252Feb 16, 2025Updated last year
ali-vilab / In-Context-LoRA
View on GitHub
Official repository of In-Context LoRA for Diffusion Transformers
☆2,073Dec 20, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Kwai-Kolors / Kolors
View on GitHub
Kolors Team
☆4,612Nov 13, 2024Updated last year
ali-vilab / MimicBrush
View on GitHub
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
☆1,308Jun 15, 2024Updated last year
instantX-research / Regional-Prompting-FLUX
View on GitHub
Training-free Regional Prompting for Diffusion Transformers 🔥
☆697Nov 28, 2024Updated last year
PixArt-alpha / PixArt-alpha
View on GitHub
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
☆3,299Oct 31, 2024Updated last year
XLabs-AI / x-flux
View on GitHub
☆2,236Nov 8, 2024Updated last year
JIA-Lab-research / ControlNeXt
View on GitHub
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
☆1,645Sep 25, 2024Updated last year
Yuanshi9815 / OminiControl
View on GitHub
[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer
☆1,910Jul 3, 2025Updated 10 months ago
kijai / ComfyUI-LuminaWrapper
View on GitHub
☆196Jul 31, 2024Updated last year
catcathh / UltraPixel
View on GitHub
Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
☆617Sep 27, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
google / style-aligned
View on GitHub
Official code for "Style Aligned Image Generation via Shared Attention"
☆1,320Dec 29, 2023Updated 2 years ago
ZYM-PKU / UDiffText
View on GitHub
[ECCV 2024] Official repo for UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diff…
☆234Feb 14, 2025Updated last year
TheMistoAI / ComfyUI-Anyline
View on GitHub
Anyline: A Fast, Accurate, and Detailed Line Detection Preprocessor
☆496Sep 5, 2025Updated 8 months ago
G-U-N / Be-Your-Outpainter
View on GitHub
[ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745
☆255Apr 19, 2025Updated last year
lllyasviel / LayerDiffuse
View on GitHub
Transparent Image Layer Diffusion using Latent Transparency
☆2,205Jun 16, 2024Updated last year
PixArt-alpha / PixArt-sigma
View on GitHub
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
☆1,922Oct 31, 2024Updated last year
tencent-ailab / IP-Adapter
View on GitHub
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
☆6,584Jun 28, 2024Updated last year
TencentARC / BrushNet
View on GitHub
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
☆1,732Dec 17, 2024Updated last year
jdh-algo / JoyType
View on GitHub
JoyType: A Robust Design for Multilingual Visual Text Creation
☆39Sep 21, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
LituRout / RF-Inversion
View on GitHub
Rectified Flow Inversion (RF-Inversion) - ICLR 2025
☆474Mar 19, 2025Updated last year
bowen-upenn / ControlText
View on GitHub
ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations
☆35Apr 3, 2025Updated last year
RockeyCoss / SPO
View on GitHub
[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
☆270Apr 7, 2025Updated last year
TheMistoAI / MistoLine
View on GitHub
A Versatile and Robust SDXL-ControlNet Model for Adaptable Line Art Conditioning
☆555Jan 6, 2026Updated 4 months ago
HaozheLiu-ST / T-GATE
View on GitHub
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
☆418Feb 26, 2025Updated last year
stepfun-ai / Step1X-Edit
View on GitHub
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…
☆2,212Apr 29, 2026Updated 3 weeks ago
bghira / SimpleTuner
View on GitHub
A general fine-tuning kit geared toward image/video/audio diffusion models.
☆2,833Updated this week