YutingLi0606 / HTR-VT
(Pattern Recognition) Pytorch implementation of “HTR-VT: Handwritten Text Recognition with Vision Transformer”
☆62Updated last month
Alternatives and similar repositories for HTR-VT:
Users that are interested in HTR-VT are comparing it to the libraries listed below
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆46Updated 5 months ago
- Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…☆26Updated 7 months ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆70Updated 6 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆60Updated 9 months ago
- ☆80Updated last month
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆80Updated 9 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- ☆14Updated 9 months ago
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆50Updated 10 months ago
- ☆26Updated 2 months ago
- Evaluating GPT-4o's image generation and editing ability in OCR tasks.☆36Updated last week
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆34Updated 2 weeks ago
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer☆76Updated last year
- [IEEE TPAMI] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation☆257Updated 2 weeks ago
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆33Updated 3 weeks ago
- [ECCV2024] PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer☆70Updated 7 months ago
- ☆24Updated last year
- -☆21Updated 2 years ago
- ☆23Updated last month
- Hadwritten Text Recognition in Few-shot Scenario☆20Updated 2 years ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆42Updated 8 months ago
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…☆49Updated 9 months ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆35Updated 7 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆65Updated 7 months ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆27Updated last year
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆13Updated 7 months ago
- ☆29Updated 9 months ago
- Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆18Updated 2 months ago
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆53Updated 9 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆14Updated 4 months ago