AI4Bharat / DocSim
Synthetically generate random text document images with ground-truth
☆11Updated 3 years ago
Alternatives and similar repositories for DocSim:
Users that are interested in DocSim are comparing it to the libraries listed below
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆8Updated last year
- Detect handwritten words (neural network based).☆67Updated 2 years ago
- Pytorch implementation of our paper: Adapting OCR with Limited Labels☆59Updated last year
- A simple document detector in python3☆50Updated 2 years ago
- Unofficial implementation of the paper "Full Page Handwriting Recognition via Image to Sequence Extraction" by Singh et al. (2021).☆52Updated 2 years ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated last year
- ☆20Updated 5 years ago
- Close-Domain fine-tuning for table detection☆72Updated 2 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- Document Image Enhancement with GANs - TPAMI journal☆192Updated last year
- Detect the tables in a form and extract the tables as well as the cells of the tables.☆62Updated 4 years ago
- Sample implementation of OCR metrics (CER, WER) calculation with TesseractOCR and fastwer☆27Updated 3 years ago
- Quicksign OCRized Text Dataset (QS-OCR)☆44Updated 5 years ago
- Detect textlines in document images☆91Updated 8 months ago
- A Gated and Bifurcated Stacked U-Net Module for Document Image Dewarping☆105Updated 2 years ago
- This repository contains a notebook to demonstrate the power of Document Text Recognition (DocTR) library☆12Updated 3 years ago
- TableNet Implementation on Pytorch☆147Updated 2 years ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆389Updated 4 years ago
- CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images☆133Updated 3 weeks ago
- ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...☆177Updated 3 years ago
- ☆62Updated 3 years ago
- ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for us…☆55Updated this week
- Checkbox Detection Model for Scanned Documents☆61Updated last year
- Working codes for project☆23Updated last year
- This repository contains a 403 images dataset for table detection in documents.☆83Updated 6 years ago
- Recognition of handwritten text using CRAFT text detection and TrOCR☆25Updated 2 years ago
- ☆129Updated last year
- Detectron2 for Document Layout Analysis☆185Updated 6 months ago
- Code for my ICDAR paper "Deep Visual Template-Free Form Parsing"☆88Updated 3 years ago
- sambalshikhar / Document-Image-Classification-with-Intra-Domain-Transfer-Learning-and-Stacked-Generalization-of-DeepRVL-CDIP could be looked at as the equivalent of ImageNet for the document image community. It’s certainly the largest we’ve seen in the …☆18Updated 5 years ago