EriCongMa / awesome-transformer-ocr
This repository is created to share current progress of transformer based optical character recognition(OCR). Welcome to share~
☆54Updated last year
Alternatives and similar repositories for awesome-transformer-ocr:
Users that are interested in awesome-transformer-ocr are comparing it to the libraries listed below
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆77Updated 3 years ago
- swin-transformer custom for OCR☆114Updated last year
- A curated list of papers about key information extraction.☆91Updated 3 months ago
- Table Structure Recognition☆69Updated 2 years ago
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Updated 2 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆120Updated last year
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆42Updated last year
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆185Updated last month
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated last year
- DocILE: Document Information Localization and Extraction Benchmark☆123Updated 10 months ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆139Updated last year
- CRAFT(Baek et al., 2019) model training code☆46Updated 7 months ago
- A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers☆57Updated 6 months ago
- Official implementation for Dessurt☆58Updated 2 years ago
- ☆43Updated 2 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆82Updated last year
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆40Updated last year
- ☆81Updated last month
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆36Updated last year
- Distorted Document Images dataset (DDI-100).☆135Updated 2 years ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆70Updated 6 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆78Updated 9 months ago
- TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)☆23Updated last year
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆270Updated 2 years ago
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆83Updated 2 years ago
- A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research w…☆85Updated 4 months ago
- A large scale camera-taken table detection and recognition dataset.☆123Updated last year
- ☆80Updated 3 weeks ago