YutingLi0606 / HTR-VT
(Pattern Recognition) Pytorch implementation of “HTR-VT: Handwritten Text Recognition with Vision Transformer”
☆40Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for HTR-VT
- Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…☆18Updated 3 months ago
- [ECCV2024] PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer☆58Updated 2 months ago
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆68Updated 4 months ago
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer☆72Updated 7 months ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆33Updated 3 months ago
- ☆74Updated 11 months ago
- [TPAMI'24] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation☆211Updated last week
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆50Updated 5 months ago
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆12Updated 2 months ago
- ☆18Updated this week
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆39Updated 5 months ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆28Updated 2 months ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆70Updated 2 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"☆81Updated 2 months ago
- Official implementation of High Fidelity Scene Text Synthesis.☆36Updated this week
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆28Updated 7 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆60Updated 2 months ago
- ☆10Updated 4 months ago
- Basic HTR concepts/modules to boost performance☆21Updated 4 months ago
- Object Recognition as Next Token Prediction (CVPR 2024 Highlight)☆161Updated last month
- ☆22Updated 9 months ago
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression☆56Updated 2 months ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆73Updated last year
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆20Updated 3 months ago
- ☆59Updated 5 months ago
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆20Updated 7 months ago
- The Learnable Typewriter: A Generative Approach to Text Line Analysis☆28Updated 3 weeks ago
- Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)☆52Updated 5 months ago
- The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"☆17Updated last year