SamSamhuns / donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

☆8

Alternatives and similar repositories for donut:

Users that are interested in donut are comparing it to the libraries listed below

mdv3101 / CDeCNet
CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images
☆133Updated 3 months ago
abdoelsayed2016 / TNCR_Dataset
Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…
☆68Updated last year
phamquiluan / table-transformer
CVPR 2022: Table Structure Recognition
☆39Updated 3 years ago
AyanGadpal / TextTron-Lightweight-text-detector
TextTron is a simple light-weight image processing based text detector for document images.
☆52Updated 4 years ago
rnjtsh / graphical-object-detector
Graphical Object Detection in Document Images
☆26Updated 4 years ago
herobd / dessurt
Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer
☆59Updated 2 years ago
cndplab-founder / ctdar_measurement_tool
Evaluation Tool for the ICDAR 2019 Competition on Table Detection and Recognition
☆41Updated 2 years ago
antoinedelplace / Chargrid
Extraction of meaningful instances from document images with a Chargrid model
☆34Updated 3 years ago
machine-intelligence-laboratory / DDI-100
Distorted Document Images dataset (DDI-100).
☆137Updated 2 years ago
herobd / FUDGE
Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"
☆33Updated 3 years ago
JPLeoRX / detectron2-publaynet
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
☆48Updated 2 years ago
dhavalpotdar / Graph-Convolution-on-Structured-Documents
This repo contains code to convert Structured Documents to Graphs and implement a Graph Convolution Neural Network for node classificatio…
☆145Updated 2 years ago
bikash / DocumentUnderstanding
Research papers and code on information extraction from image/pdf
☆96Updated 2 years ago
saifullah3396 / docxclassifier
☆17Updated 9 months ago
rossumai / docile
DocILE: Document Information Localization and Extraction Benchmark
☆125Updated 11 months ago
tomassosorio / OCR_tablenet
TableNet Implementation on Pytorch
☆147Updated 2 years ago
furkanbiten / idl_data
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Updated 2 years ago
shabie / docformer
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…
☆276Updated 2 years ago
facebookresearch / MultiplexedOCR
Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR
☆80Updated 2 years ago
clovaai / spade
☆81Updated last year
phamquiluan / PubLayNet
ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...
☆179Updated 3 years ago
jpWang / LiLT
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…
☆347Updated 2 years ago
entropy2333 / awesome-key-information-extraction
A curated list of papers about key information extraction.
☆93Updated 4 months ago
herobd / Visual-Template-Free-Form-Parsing
Code for my ICDAR paper "Deep Visual Template-Free Form Parsing"
☆88Updated 3 years ago
anisha2102 / docvqa
Document Visual Question Answering
☆116Updated 4 years ago
YongWookHa / swin-transformer-ocr
swin-transformer custom for OCR
☆115Updated last year
NjoyimPeguy / ICDAR-2019-RRC-SROIE
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
☆32Updated 2 years ago
sachinraja13 / TabStructNet
☆129Updated 2 years ago
IBM / SynthTabNet
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆141Updated last year
hpanwar08 / detectron2
Detectron2 for Document Layout Analysis
☆187Updated 9 months ago