The official implementation of RS-STE proposed by our paper Recognition-Synergistic Scene Text Editing (CVPR 2025).
☆29Jul 15, 2025Updated 7 months ago
Alternatives and similar repositories for RS-STE
Users that are interested in RS-STE are comparing it to the libraries listed below
Sorting:
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆98Mar 16, 2025Updated 11 months ago
- ☆60Jul 25, 2023Updated 2 years ago
- Official implementation code of the paper <AnyText2: Visual Text Generation and Editing With Customizable Attributes>☆183Nov 26, 2025Updated 3 months ago
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆143Apr 11, 2025Updated 10 months ago
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆79Mar 24, 2025Updated 11 months ago
- [ICCV 2025] Official Implementation of RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model for Referring …☆18Jun 27, 2025Updated 8 months ago
- Implementation of Baseline for Scene Text-to-Scene Text Translation☆19Mar 30, 2025Updated 11 months ago
- The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""☆98Oct 20, 2025Updated 4 months ago
- This is a data generator of SRNet which is the model of paper Editing Text in the wild.☆114Jan 19, 2023Updated 3 years ago
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…☆62Jul 4, 2024Updated last year
- Code Implementation of the Paper: EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering☆49Jun 16, 2025Updated 8 months ago
- More suitable IP-Adapter for the DiT architecture☆31Jul 5, 2024Updated last year
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆93Oct 24, 2024Updated last year
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆130Nov 18, 2024Updated last year
- [CVPRW23] Implementation of ''FishDreamer: Towards Fisheye Semantic Completion via Unified Image Outpainting and Segmentation''☆35Sep 5, 2023Updated 2 years ago
- The official repository of Real Text Manipulation (RTM)☆43Mar 18, 2025Updated 11 months ago
- Official implementation of “The Source Image is the Best Attention for Infrared and Visible Image Fusion”☆23Oct 16, 2025Updated 4 months ago
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆88Jul 11, 2024Updated last year
- A stable & generalizable GRPO method for AR image generation☆31Oct 1, 2025Updated 5 months ago
- [ICLR 2026] Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks☆30Feb 5, 2026Updated last month
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆20Sep 24, 2025Updated 5 months ago
- [ECCV 2024] Official code repository of paper titled "Efficient 3D-Aware Facial Image Editing Via Attribute-Specific Prompt Learning"☆10Aug 2, 2024Updated last year
- 我的小窝, 装修全纪录☆11Apr 19, 2021Updated 4 years ago
- 【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending☆14Jun 16, 2025Updated 8 months ago
- [ICCV 2025] "Player-Centric Multimodal Prompt Generation for Large Language Model Based Identity-Aware Basketball Video Captioning".☆18Dec 11, 2025Updated 2 months ago
- ☆22Feb 3, 2026Updated last month
- ☆13Jul 28, 2024Updated last year
- CVPR 2025 Accepted Papers☆24Dec 20, 2025Updated 2 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 4 months ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆18Oct 19, 2025Updated 4 months ago
- The official pytorch implementation of the paper PromptHSI: Universal Hyperspectral Image Restoration Framework for Composite Degradation…☆15Feb 21, 2026Updated 2 weeks ago
- ☆100Jan 3, 2024Updated 2 years ago
- ☆100Aug 1, 2024Updated last year
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆46Apr 11, 2025Updated 10 months ago
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆216Apr 5, 2025Updated 11 months ago
- ☆12Oct 17, 2024Updated last year
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated last month
- ☆31Feb 18, 2026Updated 2 weeks ago
- Training LoRAs (Low-Rank Adaptations) for the black-forest-labs/FLUX.1-Fill-dev model.☆10Feb 22, 2025Updated last year