PeterrrrLi / ResNet-Transformer-OCR-PytorchLinks

ResNet for License Plate Detection & Multi-Head Self Attention Transformer for OCR

☆16

Alternatives and similar repositories for ResNet-Transformer-OCR-Pytorch

Users that are interested in ResNet-Transformer-OCR-Pytorch are comparing it to the libraries listed below

Sorting:

husterpzh / PSSR
Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration （CVPR2023）"
☆10Updated last year
cxfyxl / VIPTR
☆40Updated 11 months ago
wzx99 / CLIPOCR
☆38Updated last year
zhaominyiz / STIRER
STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023
☆13Updated 6 months ago
xhli-git / DocSAM
☆13Updated 2 months ago
MelosY / CAM
☆25Updated last year
CyrilSterling / LPV
The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)
☆27Updated last year
ku21fan / CLL-STR
Cross-lingual learning in scene text recognition (ICASSP2024)
☆16Updated 8 months ago
liuzhuang1024 / SAM
Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition (ICDAR 2023)
☆15Updated last year
yflv-yanxia / Papers
☆23Updated 6 months ago
irisXcoding / DocReal
DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction
☆23Updated 2 years ago
mxin262 / ESTextSpotter
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆76Updated last year
whlscut / DocLayLLM
[CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
☆15Updated 3 months ago
SCUT-DLVCLab / OCR-Reasoning
[arXiv: 2505.17163] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning
☆53Updated last month
Gyann-z / FDP
☆12Updated 2 months ago
beargolden / DP-LinkNet
DP-LinkNet: A convolutional network for historical document image binarization
☆23Updated 4 years ago
DCGM / SoftCTC
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
☆19Updated 2 years ago
PriNing / ODM
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
☆38Updated 2 months ago
shannanyinxiang / SPTS
Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)
☆142Updated last year
HCIILAB / LAST
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Updated last year
SJTU-DeepVisionLab / PosFormer
[ECCV2024] PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer
☆80Updated 2 months ago
lancercat / OSOCR
☆10Updated last year
lancercat / VSDF
☆24Updated last year
ThunderVVV / RCLSTR
Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`
☆17Updated last year
wk-ff / GTC
reimplement of "GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text Recognition"
☆16Updated 4 years ago
jfkuang / CFAM
Contrast-guided Feature Adjustment Module for Visual Information Extraction
☆29Updated 2 years ago
ZeningLin / PEneo
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
☆34Updated 2 months ago
GbotHQ / ocr-dataset-rendering
☆28Updated last year
HCIILAB / Text-Image-Augmentation
Geometric Augmentation for Text Image
☆9Updated 5 years ago
XiiZhao / cbn.pytorch
Official PyTorch implementation of "CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection"
☆21Updated last year