koninik / DiffusionPen
Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024
☆46Updated 5 months ago
Alternatives and similar repositories for DiffusionPen:
Users that are interested in DiffusionPen are comparing it to the libraries listed below
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆80Updated 9 months ago
- ☆14Updated 9 months ago
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…☆17Updated 6 months ago
- ☆80Updated last month
- ☆23Updated last month
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆71Updated 3 weeks ago
- ☆92Updated 8 months ago
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…☆49Updated 9 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆60Updated 10 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆34Updated last month
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆35Updated 7 months ago
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆50Updated 10 months ago
- ☆95Updated last year
- The official code of CornerTransformer (ECCV 2022, Oral) on top of MMOCR.☆140Updated 2 years ago
- The official project of paper "Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing"☆60Updated 2 months ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆34Updated 2 weeks ago
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆128Updated this week
- ☆24Updated last year
- ☆56Updated last year
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆42Updated 8 months ago
- UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models☆222Updated 2 months ago
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression☆63Updated last month
- Evaluating GPT-4o's image generation and editing ability in OCR tasks.☆39Updated last week
- [TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild☆35Updated last year
- ☆40Updated 9 months ago
- ☆15Updated last year
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆136Updated last month
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆53Updated 10 months ago
- FETNet: Feature Erasing and Transferring Network for Scene Text Removal☆25Updated last year