rsommerfeld / trocrLinks

Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models".

☆208

Alternatives and similar repositories for trocr

Users that are interested in trocr are comparing it to the libraries listed below

Sorting:

phamquiluan / jdeskew
ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation
☆141Updated 2 months ago
arvindrajan92 / DTrOCR
A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition
☆179Updated last week
him4318 / Transformer-ocr
Handwritten text recognition using transformers.
☆158Updated 11 months ago
qurator-spk / eynollah
Document Layout Analysis
☆378Updated last month
LynnHaDo / Document-Layout-Analysis
Object Detection Model for Scanned Documents
☆93Updated 4 months ago
JPLeoRX / detectron2-publaynet
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
☆48Updated 2 years ago
rossumai / docile
DocILE: Document Information Localization and Extraction Benchmark
☆130Updated last year
sparkfish / augraphy
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
☆436Updated 3 weeks ago
Layout-Parser / layout-model-training
The scripts for training Detectron2-based Layout Models on popular layout analysis datasets
☆212Updated last year
DS4SD / DocLayNet
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
☆354Updated 2 years ago
Psarpei / Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
☆277Updated 2 years ago
kartikgill / Easter2
Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION
☆80Updated 2 years ago
tomassosorio / OCR_tablenet
TableNet Implementation on Pytorch
☆148Updated 2 years ago
jpWang / LiLT
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…
☆351Updated 2 years ago
fcakyon / craft-text-detector
Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector
☆264Updated 3 years ago
xinke-wang / OCRDatasets
A collection of OCR-related datasets
☆175Updated 2 years ago
SCUT-DLVCLab / Document-AI-Recommendations
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
☆194Updated 4 months ago
Vishnunkumar / craft_hw_ocr
Recognition of handwritten text using CRAFT text detection and TrOCR
☆26Updated 2 years ago
czczup / FAST
Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
☆203Updated last month
githubharald / HTRPipeline
Detect and read handwritten words on scanned pages.
☆122Updated last year
shabie / docformer
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…
☆280Updated 2 years ago
jainammm / TableNet
Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Docum…
☆329Updated 2 years ago
qurator-spk / sbb_binarization
Document Image Binarization
☆77Updated 8 months ago
phamquiluan / PubLayNet
ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...
☆182Updated 4 years ago
andreagemelli / doc2graph
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
☆126Updated 2 years ago
sbrunner / deskew
Library used to deskew a scanned document
☆473Updated last week
ayanban011 / SwinDocSegmenter
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
☆73Updated 10 months ago
IBM / SynthTabNet
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆145Updated 2 months ago
dmitrijsk / AttentionHTR
Attention-based sequence-to-sequence model for handwritten word recognition
☆60Updated 9 months ago
fh2019ustc / DocTr
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
☆394Updated 3 weeks ago