yyyyyxie/textflux

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yyyyyxie/textflux)

yyyyyxie / textflux

TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis

☆98

Alternatives and similar repositories for textflux

Users that are interested in textflux are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EzioBy / Calligrapher
View on GitHub
Calligrapher: Freestyle Text Image Customization
☆296Sep 3, 2025Updated 10 months ago
hxixixh / amo-release
View on GitHub
Official implementation for CVPR 2025 paper "AMO Sampler: Enhancing Text Rendering with Overshooting"
☆30May 3, 2025Updated last year
yyyyyxie / DNTextSpotter
View on GitHub
[ACMMM 2024]: Official implementation of the paper "DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training"
☆38Jan 14, 2026Updated 6 months ago
tyxsspa / AnyText2
View on GitHub
Official implementation code of the paper <AnyText2: Visual Text Generation and Editing With Customizable Attributes>
☆211Nov 26, 2025Updated 7 months ago
AMAP-ML / FluxText
View on GitHub
Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"
☆729Nov 24, 2025Updated 7 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Shakker-Labs / RepText
View on GitHub
RepText: Rendering Visual Text via Replicating 🔥
☆139Jun 7, 2025Updated last year
songyiren725 / EasyText
View on GitHub
Code Implementation of the Paper: EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering
☆56Jun 16, 2025Updated last year
CodeGoat24 / DreamText
View on GitHub
[CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.
☆82Mar 24, 2025Updated last year
Zhenhang-Li / GlyphOnly
View on GitHub
【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending
☆14Jun 16, 2025Updated last year
weichaozeng / TextCtrl
View on GitHub
[2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
☆105Mar 16, 2025Updated last year
bowen-upenn / ControlText
View on GitHub
ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations
☆35Apr 3, 2025Updated last year
shannanyinxiang / UPOCR
View on GitHub
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
☆69Jun 6, 2024Updated 2 years ago
ZhengyaoFang / RS-STE
View on GitHub
The official implementation of RS-STE proposed by our paper Recognition-Synergistic Scene Text Editing (CVPR 2025).
☆33Jun 4, 2026Updated last month
ZYM-PKU / UTDesign
View on GitHub
A Unified Framework for Stylized Text Editing and Generation in Graphic Design Images
☆15Jan 6, 2026Updated 6 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
shuyansy / Visual-Text-Processing-survey
View on GitHub
The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""
☆103Oct 20, 2025Updated 9 months ago
CIawevy / TextPecker
View on GitHub
[CVPR2026] TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering
☆53Updated this week
ecnuljzhang / brush-your-text
View on GitHub
☆99Jan 3, 2024Updated 2 years ago
alimama-creative / PosterMaker
View on GitHub
PosterMaker [CVPR 2025] https://poster-maker.github.io/
☆159Nov 12, 2025Updated 8 months ago
wangyuxin87 / Tampered_sroie
View on GitHub
The tampered text detection dataset
☆22Aug 23, 2023Updated 2 years ago
SaturMars / ComfyUI-QwenImageLoraConverter
View on GitHub
This is a ComfyUI custom node used to convert Qwen-Image LoRA files trained on the ModelScope platform to a format that ComfyUI can recog…
☆30Aug 9, 2025Updated 11 months ago
xzhe-Vision / PersonaMagic
View on GitHub
☆16Jul 29, 2025Updated 11 months ago
YaoShunyu19 / MDIQA
View on GitHub
☆19Sep 4, 2025Updated 10 months ago
Ephemeral182 / PosterCraft
View on GitHub
[ICLR'26] Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework
☆541Jan 27, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
HM-RunningHub / ComfyUI_RH_ICCustom
View on GitHub
This is a ComfyUI plug-in for TencentARC/IC-Custom
☆36Sep 3, 2025Updated 10 months ago
OPPO-Mente-Lab / GlyphDraw2
View on GitHub
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
☆87Jul 11, 2024Updated 2 years ago
QY-H00 / Conceptrol
View on GitHub
Conceptrol: Concept Control of Zero-shot Personalized Image Generation
☆47Mar 27, 2025Updated last year
ZYM-PKU / UDiffText
View on GitHub
[ECCV 2024] Official repo for UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diff…
☆236Feb 14, 2025Updated last year
AIFSH / MultiTalk2GP-ComfyUI
View on GitHub
a ComfyUI custom node for MultiTalk
☆33Jun 18, 2025Updated last year
QijiTec / ComfyUI-REDBAGEL-dfloat11
View on GitHub
A ComfyUI extention for BAGEL dfloat11 Quantized wrapper (Unified Model for Multimodal Understanding and Generation)
☆25May 30, 2025Updated last year
yichengup / ComfyUI_ycHunyuanVideoFoley
View on GitHub
HunyuanVideoFoley generates SFX audio to match your video and text prompt
☆25Sep 2, 2025Updated 10 months ago
zxYin / ConsistEdit_Code
View on GitHub
[SIGGRAPH Asia 2025] Official Implementation of "ConsistEdit: Highly Consistent and Precise Training-free Visual Editing"
☆73Apr 8, 2026Updated 3 months ago
eternal8080 / MV-MATH
View on GitHub
Description for MV-MATH
☆15Jul 20, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
PKU-YuanGroup / UniWorld
View on GitHub
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
☆883Dec 23, 2025Updated 6 months ago
sooyeon-go / eye_for_an_eye
View on GitHub
Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models
☆33Mar 9, 2026Updated 4 months ago
lrzjason / T2ITrainer
View on GitHub
Practice Code for text to image trainer
☆560Feb 27, 2026Updated 4 months ago
ymy-k / Hi-SAM
View on GitHub
[IEEE TPAMI] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
☆365May 30, 2025Updated last year
qcf-568 / OSTF
View on GitHub
[AAAI2025] Revisiting Tampered Scene Text Detection in the Era of Generative AI
☆72Jun 7, 2026Updated last month
piscesbody / Comfyui_Object_Detect_QWen_VL
View on GitHub
☆24Sep 20, 2025Updated 10 months ago
wd1511 / Awesome-Layout-Generation
View on GitHub
Awesome Layout Generation
☆85Apr 10, 2025Updated last year