bikash/DocumentUnderstanding

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bikash/DocumentUnderstanding)

bikash / DocumentUnderstanding

Research papers and code on information extraction from image/pdf

☆97

Alternatives and similar repositories for DocumentUnderstanding

Users that are interested in DocumentUnderstanding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dhavalpotdar / Graph-Convolution-on-Structured-Documents
View on GitHub
This repo contains code to convert Structured Documents to Graphs and implement a Graph Convolution Neural Network for node classificatio…
☆145Dec 8, 2022Updated 3 years ago
wenwenyu / PICK-pytorch
View on GitHub
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…
☆568Jul 25, 2024Updated last year
madhav1ag / CDeCNet
View on GitHub
CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images
☆134Sep 11, 2025Updated 10 months ago
sachinraja13 / TabStructNet
View on GitHub
☆132Mar 24, 2023Updated 3 years ago
zzzDavid / ICDAR-2019-SROIE
View on GitHub
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
☆417Jul 20, 2020Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
tiwaridipak103 / Table_extraction
View on GitHub
☆22Jun 22, 2026Updated 3 weeks ago
shahrukhqasim / TIES-2.0
View on GitHub
Code for: S.R. Qasim, H. Mahmood, and F. Shafait, Rethinking Table Recognition using Graph Neural Networks (2019)
☆276Nov 22, 2022Updated 3 years ago
antoinedelplace / Chargrid
View on GitHub
Extraction of meaningful instances from document images with a Chargrid model
☆34Aug 9, 2021Updated 4 years ago
BobLd / DocumentLayoutAnalysis
View on GitHub
Document Layout Analysis resources repos for development with PdfPig.
☆637Oct 1, 2023Updated 2 years ago
prohandler / GS-Bulk-Emails
View on GitHub
Google App Scripts that sends a number of emails from the specific number and that tracks the open status of each email
☆17Dec 11, 2024Updated last year
tstanislawek / awesome-document-understanding
View on GitHub
A curated list of resources for Document Understanding (DU) topic
☆1,525Jun 2, 2023Updated 3 years ago
ruifcruz / sroie-on-layoutlm
View on GitHub
☆42Feb 6, 2021Updated 5 years ago
herobd / Visual-Template-Free-Form-Parsing
View on GitHub
Code for my ICDAR paper "Deep Visual Template-Free Form Parsing"
☆89Jan 14, 2022Updated 4 years ago
Academic-Hammer / SciTSR
View on GitHub
Table structure recognition dataset of the paper: Complicated Table Structure Recognition
☆383Jul 7, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jainammm / TableNet
View on GitHub
Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Docum…
☆324Mar 25, 2023Updated 3 years ago
jaywalnut310 / linear-transformer-for-table-recognition
View on GitHub
code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)
☆22Jun 16, 2021Updated 5 years ago
beacandler / EATEN
View on GitHub
EATEN: Entity-aware Attention for Single Shot Visual Text Extraction
☆183Dec 29, 2019Updated 6 years ago
rasmusbergpalm / attend-copy-parse
View on GitHub
Code for the paper attend, copy, parse - End-to-end information extraction from documents (https://arxiv.org/pdf/1812.07248.pdf)
☆13Jun 2, 2022Updated 4 years ago
rossumai / docile
View on GitHub
DocILE: Document Information Localization and Extraction Benchmark
☆149Jun 17, 2026Updated last month
vsymbol / CUTIE
View on GitHub
CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)
☆152Dec 8, 2022Updated 3 years ago
DevashishPrasad / CascadeTabNet
View on GitHub
This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table …
☆1,549Aug 27, 2021Updated 4 years ago
doc-analysis / DocBank
View on GitHub
DocBank: A Benchmark Dataset for Document Layout Analysis
☆652Aug 12, 2024Updated last year
clovaai / spade
View on GitHub
☆82Jun 12, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
applicaai / lambert
View on GitHub
Publicly released code for the LAMBERT model
☆106Jun 14, 2021Updated 5 years ago
BordiaS / layoutlm
View on GitHub
☆97Jul 13, 2020Updated 6 years ago
sciencefictionlab / chargrid-pytorch
View on GitHub
Pytorch Implementation of Chargrid Paper (https://arxiv.org/abs/1809.08799)
☆27Mar 11, 2022Updated 4 years ago
thisisbhavin / graphicalForest
View on GitHub
Using the adjacency matrix and random forest get the Name, Address, Items, Prices, Grand total from all kind of invoices.
☆18Mar 8, 2020Updated 6 years ago
naiveHobo / InvoiceNet
View on GitHub
Deep neural network to extract intelligent information from invoice documents.
☆2,690May 3, 2024Updated 2 years ago
anisha2102 / docvqa
View on GitHub
Document Visual Question Answering
☆130Jul 30, 2020Updated 5 years ago
entropy2333 / awesome-key-information-extraction
View on GitHub
A curated list of papers about key information extraction.
☆107Jul 8, 2026Updated last week
phamquiluan / PubLayNet
View on GitHub
ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...
☆183May 11, 2021Updated 5 years ago
ZZR8066 / GraphDoc
View on GitHub
☆45Jul 18, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
usydnlp / vdoc
View on GitHub
☆15Sep 7, 2022Updated 3 years ago
TurkuNLP / ocr-correction
View on GitHub
Post-processing OCR errors with seq2seq models
☆28Jul 30, 2020Updated 5 years ago
hassan-mahmood / TIES_DataGeneration
View on GitHub
Dataset Generation Code for: S.R. Qasim, H. Mahmood, and F. Shafait, Rethinking Table Parsing using Graph Neural Networks (2019)
☆123Aug 27, 2020Updated 5 years ago
stuartemiddleton / glosat_table_dataset
View on GitHub
GloSAT Historical Measurement Table Dataset
☆11Dec 3, 2025Updated 7 months ago
poloclub / tsr-convstem
View on GitHub
High-Performance Transformers for Table Structure Recognition Need Early Convolutions
☆45Apr 21, 2026Updated 2 months ago
ibm-aur-nlp / PubLayNet
View on GitHub
☆1,051Jul 9, 2025Updated last year
kavishgambhir / xy-cut-tree
View on GitHub
Segmenting a given document using recursive xy-cut algorithm.
☆12Oct 9, 2018Updated 7 years ago