arvindrajan92 / DTrOCRLinks

A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition

☆179

Alternatives and similar repositories for DTrOCR

Users that are interested in DTrOCR are comparing it to the libraries listed below

Sorting:

xinke-wang / OCRDatasets
A collection of OCR-related datasets
☆175Updated 2 years ago
czczup / FAST
Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
☆203Updated last month
Swall0w / dtrocr
☆62Updated last year
sparkfish / augraphy
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
☆436Updated 3 weeks ago
D641593 / MixNet
☆89Updated 5 months ago
tanguymagne / UVDoc
Code for the paper "UVDoc: Neural Grid-based Document Unwarping"
☆148Updated 11 months ago
phamquiluan / jdeskew
ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation
☆141Updated 2 months ago
SCUT-DLVCLab / Document-AI-Recommendations
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
☆194Updated 4 months ago
ZZZHANG-jx / Recommendations-Document-Image-Processing
This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadowing, …
☆273Updated last month
rsommerfeld / trocr
Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…
☆208Updated 6 months ago
clovaai / synthtiger
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
☆537Updated last year
fh2019ustc / DocTr
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
☆394Updated 3 weeks ago
ayanban011 / SwinDocSegmenter
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
☆73Updated 10 months ago
IBM / SynthTabNet
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆145Updated 2 months ago
FelixHertlein / inv3d-model
Inference, training and evaluation code for our models from the paper "Inv3D: a high-resolution 3D invoice dataset for template-guided si…
☆48Updated last year
roatienza / deep-text-recognition-benchmark
PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)
☆305Updated last year
bytedance / SPTSv2
The official implementation of SPTS v2: Single-Point Text Spotting
☆135Updated 2 years ago
MaxKinny / TabRecSet
A large scale camera-taken table detection and recognition dataset.
☆132Updated last week
LynnHaDo / Document-Layout-Analysis
Object Detection Model for Scanned Documents
☆93Updated 4 months ago
VamosC / CLIP4STR
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
☆142Updated 4 months ago
facebookresearch / MultiplexedOCR
Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR
☆80Updated 2 years ago
google-research-datasets / hiertext
The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…
☆287Updated 7 months ago
gmuffiness / CRAFT-train
CRAFT(Baek et al., 2019) model training code
☆46Updated 11 months ago
mindspore-lab / mindocr
A toolbox of ocr models and algorithms based on MindSpore
☆276Updated 3 months ago
roatienza / straug
Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labelin…
☆258Updated last year
HCIILAB / Scene-Text-Recognition-Recommendations
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
☆346Updated last year
naver-ai / trace
TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)
☆27Updated last year
baudm / parseq
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
☆650Updated last year
ViTAE-Transformer / DeepSolo
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:…
☆273Updated last month
felixdittrich92 / OnnxTR
OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR
☆135Updated 3 weeks ago