bowen-upenn / ControlText
ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations
☆22Updated last month
Alternatives and similar repositories for ControlText
Users that are interested in ControlText are comparing it to the libraries listed below
Sorting:
- RepText: Rendering Visual Text via Replicating 🔥☆73Updated 2 weeks ago
- JoyType: A Robust Design for Multilingual Visual Text Creation☆33Updated 5 months ago
- Conceptrol: Concept Control of Zero-shot Personalized Image Generation☆38Updated last month
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆62Updated last month
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆32Updated 6 months ago
- an unofficial implementation of dreamtuner☆24Updated last year
- Official repo for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆108Updated last week
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"☆76Updated last month
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆73Updated 10 months ago
- ☆29Updated 7 months ago
- ☆32Updated last year
- Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆55Updated 8 months ago
- More suitable IP-Adapter for the DiT architecture☆29Updated 10 months ago
- ☆40Updated 7 months ago
- ☆49Updated 4 months ago
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆59Updated last month
- Official PyTorch Implementation of "Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generati…☆26Updated last year
- Official repository of IDEA-Bench☆34Updated 3 months ago
- [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models☆67Updated last year
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆124Updated 10 months ago
- ☆67Updated last week
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)☆42Updated last month
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆59Updated 2 months ago
- EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆24Updated last month
- ☆25Updated 9 months ago
- [CVPR2024] Content-Style Decoupling for Unsupervised Makeup Transfer without Generating Pseudo Ground Truth☆75Updated 2 months ago
- ☆14Updated 4 months ago
- ☆38Updated 8 months ago
- ☆15Updated 2 months ago
- X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation☆68Updated last month