AI4Bharat / DocSim

Synthetically generate random text document images with ground-truth

☆11

Alternatives and similar repositories for DocSim:

Users that are interested in DocSim are comparing it to the libraries listed below

githubharald / WordDetectorNN
Detect handwritten words (neural network based).
☆70Updated 3 years ago
Deepayan137 / Adapting-OCR
Pytorch implementation of our paper: Adapting OCR with Limited Labels
☆60Updated last year
crazycloud / Handwritten-text-Detection-Detectron2
Handwritten text detection in document images using Detectron2
☆20Updated 3 years ago
kartikgill / Easter2
Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION
☆79Updated 2 years ago
Theivaprakasham / layoutlmv3
This Repository consists of all my experiments performed on LayoutLMv3 model.
☆29Updated 2 years ago
amzn / convolutional-handwriting-gan
ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation (CVPR20)
☆272Updated 4 years ago
Praneet9 / Representation-Learning-for-Information-Extraction
Pytorch implementation of Paper by Google Research - Representation Learning for Information Extraction from Form-like Documents.
☆100Updated 2 years ago
qurator-spk / sbb_textline_detection
Detect textlines in document images
☆92Updated 11 months ago
SamSamhuns / donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
☆8Updated last year
herobd / Visual-Template-Free-Form-Parsing
Code for my ICDAR paper "Deep Visual Template-Free Form Parsing"
☆88Updated 3 years ago
Shakleen / Python-Document-Detector
A simple document detector in python3
☆51Updated 2 years ago
kayoyin / DirtyDocuments
☆22Updated 5 years ago
bhattbhavesh91 / DocTR-OCR-tutorial
This repository contains a notebook to demonstrate the power of Document Text Recognition (DocTR) library
☆12Updated 3 years ago
qurator-spk / eynollah
Document Layout Analysis
☆371Updated this week
janzd / CRNN
Convolutional recurrent neural network for scene text recognition or OCR in Keras
☆125Updated 4 years ago
abdoelsayed2016 / TNCR_Dataset
Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…
☆68Updated last year
sparkfish / shabby-pages
ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for us…
☆57Updated last month
githubharald / WordDetector
Detect handwritten words (classic image processing based method).
☆272Updated last year
AyanGadpal / TextTron-Lightweight-text-detector
TextTron is a simple light-weight image processing based text detector for document images.
☆52Updated 4 years ago
mdv3101 / CDeCNet
CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images
☆133Updated 3 months ago
vloison / Handwritten_Text_Recognition
☆34Updated 4 years ago
rossumai / docile
DocILE: Document Information Localization and Extraction Benchmark
☆125Updated 11 months ago
iitb-research-code / indic-trocr
Transformer OCR for Indian Languages
☆10Updated last year
sciencefictionlab / chargrid-pytorch
Pytorch Implementation of Chargrid Paper (https://arxiv.org/abs/1809.08799)
☆27Updated 3 years ago
zzzDavid / ICDAR-2019-SROIE
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
☆394Updated 4 years ago
rsommerfeld / trocr
Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…
☆200Updated 3 months ago
tobiasvanderwerff / full-page-handwriting-recognition
Unofficial implementation of the paper "Full Page Handwriting Recognition via Image to Sequence Extraction" by Singh et al. (2021).
☆53Updated 2 years ago
NjoyimPeguy / ICDAR-2019-RRC-SROIE
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
☆32Updated 2 years ago
Psarpei / Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
☆274Updated 2 years ago
sambalshikhar / Document-Image-Classification-with-Intra-Domain-Transfer-Learning-and-Stacked-Generalization-of-Deep
RVL-CDIP could be looked at as the equivalent of ImageNet for the document image community. It’s certainly the largest we’ve seen in the …
☆18Updated 5 years ago