rsommerfeld / trocr
Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models".
☆179Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for trocr
- Handwritten text recognition using transformers.☆153Updated 3 months ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆127Updated 4 months ago
- Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation☆164Updated 7 months ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆161Updated 2 months ago
- ☆155Updated last year
- DocILE: Document Information Localization and Extraction Benchmark☆117Updated 5 months ago
- A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition☆81Updated 2 months ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆344Updated 2 years ago
- CRAFT(Baek et al., 2019) model training code☆42Updated 2 months ago
- ☆75Updated last year
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆128Updated 11 months ago
- Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes☆353Updated last month
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆115Updated last year
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆47Updated last year
- Object Detection Model for Scanned Documents☆82Updated last year
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆202Updated last year
- Document Image Binarization☆73Updated 3 weeks ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆265Updated last year
- This Repository consists of all my experiments performed on LayoutLMv3 model.☆28Updated 2 years ago
- The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…☆264Updated 2 months ago
- Recognition of handwritten text using CRAFT text detection and TrOCR☆25Updated last year
- Detect textlines in document images☆90Updated 5 months ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆77Updated last year
- CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images☆131Updated 2 years ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆98Updated 11 months ago
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, cand you can get the same (even better) result compared w…☆41Updated 4 months ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆70Updated last month
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆41Updated this week
- CVPR 2022: Table Structure Recognition☆39Updated 2 years ago
- TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…☆46Updated 2 years ago