YutingLi0606 / HTR-VT
(Pattern Recognition) Pytorch implementation of “HTR-VT: Handwritten Text Recognition with Vision Transformer”
☆56Updated last week
Alternatives and similar repositories for HTR-VT:
Users that are interested in HTR-VT are comparing it to the libraries listed below
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆73Updated 8 months ago
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆39Updated 4 months ago
- ☆76Updated this week
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆59Updated 9 months ago
- ☆12Updated 8 months ago
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆32Updated 2 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…☆24Updated 6 months ago
- Basic HTR concepts/modules to boost performance☆27Updated 3 months ago
- [ECCV2024] PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer☆68Updated 6 months ago
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆48Updated 9 months ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆32Updated last month
- [TPAMI'24] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation☆247Updated 3 months ago
- ☆23Updated this week
- ☆23Updated last year
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆64Updated last month
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆10Updated 2 years ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆35Updated 6 months ago
- VimTS: A Unified Video and Image Text Spotter☆76Updated 4 months ago
- UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models☆221Updated 3 weeks ago
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression☆62Updated 2 weeks ago
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer☆76Updated 11 months ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆70Updated 6 months ago
- ☆19Updated 3 years ago
- ☆68Updated 8 months ago
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Updated 2 years ago
- ☆40Updated 8 months ago
- The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"☆105Updated 2 months ago
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆13Updated 6 months ago