AI4Bharat / DocSimLinks
Synthetically generate random text document images with ground-truth
☆12Updated 4 years ago
Alternatives and similar repositories for DocSim
Users that are interested in DocSim are comparing it to the libraries listed below
Sorting:
- Checkbox Detection Model for Scanned Documents☆91Updated 11 months ago
- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition☆282Updated 3 years ago
- A comprehensive tutorial for OCR in python using Tesseract-OCR and OpenCV☆129Updated 3 years ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆408Updated 5 years ago
- Handwritten text detection in document images using Detectron2☆21Updated 4 years ago
- ☆392Updated 2 years ago
- TableNet Implementation on Pytorch☆150Updated 3 years ago
- TextTron is a simple light-weight image processing based text detector for document images.☆53Updated 5 years ago
- Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Docum…☆325Updated 2 years ago
- Library used to deskew a scanned document☆498Updated this week
- Document Layout Analysis☆395Updated this week
- Comparison-of-OCR (KerasOCR, PyTesseract,EasyOCR)☆63Updated 3 years ago
- Pytorch implementation of our paper: Adapting OCR with Limited Labels☆62Updated 2 years ago
- BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on sc…☆113Updated 3 years ago
- Detect handwritten words (neural network based).☆73Updated 3 years ago
- CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images☆134Updated 4 months ago
- Unofficial implementation of the paper "Full Page Handwriting Recognition via Image to Sequence Extraction" by Singh et al. (2021).☆53Updated 3 years ago
- ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation (CVPR20)☆278Updated 5 years ago
- Document Image Binarization☆79Updated last year
- A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents usi…☆507Updated 2 years ago
- Pytorch Implementation of Chargrid Paper (https://arxiv.org/abs/1809.08799)☆27Updated 3 years ago
- ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...☆183Updated 4 years ago
- ☆1,000Updated last year
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆241Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆345Updated 2 years ago
- ☆63Updated 4 years ago
- Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector☆272Updated 3 years ago
- ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for us…☆62Updated 10 months ago
- A simple document detector in python3☆51Updated 2 years ago
- Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes☆503Updated 6 months ago