YutingLi0606 / HTR-VT
(Pattern Recognition) Pytorch implementation of “HTR-VT: Handwritten Text Recognition with Vision Transformer”
☆51Updated this week
Alternatives and similar repositories for HTR-VT:
Users that are interested in HTR-VT are comparing it to the libraries listed below
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆36Updated 3 months ago
- ☆76Updated last year
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆73Updated 7 months ago
- ☆22Updated last week
- ☆12Updated 7 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆58Updated 8 months ago
- Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…☆24Updated 6 months ago
- ☆23Updated last year
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆10Updated 2 years ago
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆30Updated last month
- [ECCV2024] PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer☆66Updated 5 months ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆37Updated 6 months ago
- The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"☆100Updated last month
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆46Updated 8 months ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆70Updated 5 months ago
- Basic HTR concepts/modules to boost performance☆25Updated 2 months ago
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…☆14Updated 4 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆63Updated 5 months ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆76Updated 2 years ago
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆58Updated last month
- Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆18Updated 2 weeks ago
- UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models☆217Updated last week
- Official implementation of High Fidelity Scene Text Synthesis.☆46Updated last month
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆31Updated 3 weeks ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆31Updated 5 months ago
- VimTS: A Unified Video and Image Text Spotter☆76Updated 3 months ago