NJU-PCALab / TextCrafterLinks
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
β81Updated 3 months ago
Alternatives and similar repositories for TextCrafter
Users that are interested in TextCrafter are comparing it to the libraries listed below
Sorting:
- RepText: Rendering Visual Text via Replicating π₯β138Updated 5 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editingβ160Updated 4 months ago
- Conceptrol: Concept Control of Zero-shot Personalized Image Generationβ44Updated 7 months ago
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"β73Updated 2 months ago
- [IJCAI 2025 (Oral)] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion β¦β99Updated 6 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Datasetβ233Updated 3 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]β132Updated 9 months ago
- Scripts to teach Flux the task of image editing from language with the Flux Control framework.β96Updated 4 months ago
- β51Updated 11 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Searchβ99Updated last month
- [AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting withβ¦β150Updated 2 months ago
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from uβ¦β209Updated 6 months ago
- [ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video β¦β155Updated 7 months ago
- An Efficient Text-to-Image Generation Pretrain Pipelineβ119Updated 7 months ago
- [ICCV 2025] CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generationβ117Updated 3 months ago
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"β84Updated 5 months ago
- Subjects200K datasetβ123Updated 10 months ago
- Blending Custom Photos with Video Diffusion Transformersβ48Updated 9 months ago
- [ICCV 2025] DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models (official implement)β145Updated 5 months ago
- Finetuning and inference tools for the CogView4 and CogVideoX model series.β104Updated 6 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesisβ63Updated 6 months ago
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generationβ200Updated 9 months ago
- [SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editingβ23Updated this week
- Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillβ¦β86Updated 4 months ago
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".β162Updated 11 months ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion β¦β160Updated last year
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLMβ69Updated 4 months ago
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpaintingβ120Updated 10 months ago
- β27Updated last year
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedbackβ166Updated 3 weeks ago