YutingLi0606 / HTR-VTLinks

(Pattern Recognition) Pytorch implementation of “HTR-VT: Handwritten Text Recognition with Vision Transformer”

☆85

Alternatives and similar repositories for HTR-VT

Users that are interested in HTR-VT are comparing it to the libraries listed below

Sorting:

guoxy25 / Ocean-OCR
☆36Updated 5 months ago
ymy-k / Hi-SAM
[IEEE TPAMI] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
☆290Updated last month
koninik / DiffusionPen
Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024
☆52Updated 8 months ago
aimagelab / VATr
☆82Updated 4 months ago
ayanban011 / SwinDocSegmenter
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
☆73Updated 10 months ago
koninik / WordStylist
Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023
☆81Updated last year
georgeretsi / HTR-best-practices
Basic HTR concepts/modules to boost performance
☆32Updated 7 months ago
whlscut / DocLayLLM
[CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
☆17Updated 4 months ago
EDM-Research / VATr-pp
☆14Updated last year
qingzhenduyu / ICAL
Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…
☆27Updated 11 months ago
shannanyinxiang / UPOCR
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
☆56Updated last year
dali92002 / SSL-OCR
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023
☆26Updated 2 years ago
MelosY / CAM
☆25Updated last year
arvindrajan92 / DTrOCR
A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition
☆179Updated last week
Yuliang-Liu / VimTS
VimTS: A Unified Video and Image Text Spotter
☆77Updated 8 months ago
yufanchen96 / RoDLA
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
☆35Updated 3 months ago
ispamm / NAF-DPM
NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement
☆45Updated 11 months ago
mxin262 / Bridging-Text-Spotting
(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.
☆66Updated last year
xinke-wang / OCRDatasets
A collection of OCR-related datasets
☆177Updated 2 years ago
dailenson / One-DM
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
☆420Updated 3 weeks ago
LayTextLLM / LayTextLLM
☆95Updated 6 months ago
ZZZHANG-jx / DocRes
[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
☆465Updated 2 weeks ago
Token-family / TokenFD
[ICCV2025] A Token-level Text Image Foundation Model for Document Understanding
☆109Updated 2 weeks ago
nttmdlab-nlp / InstructDoc
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)
☆161Updated last year
xhli-git / DocSAM
☆15Updated 3 months ago
yeungchenwa / HDR
[AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents
☆85Updated 3 months ago
dali92002 / DocEnTR
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022
☆166Updated 6 months ago
aimagelab / HWD
☆24Updated 4 months ago
PriNing / ODM
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
☆38Updated 3 months ago
koninik / HTG_evaluation
Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…
☆17Updated 9 months ago