jpWang / LiLTLinks

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

☆351

Alternatives and similar repositories for LiLT

Users that are interested in LiLT are comparing it to the libraries listed below

Sorting:

shabie / docformer
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…
☆281Updated 2 years ago
SCUT-DLVCLab / Document-AI-Recommendations
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
☆196Updated 5 months ago
rossumai / docile
DocILE: Document Information Localization and Extraction Benchmark
☆131Updated last year
clovaai / bros
☆159Updated 2 years ago
doc-analysis / DocBank
DocBank: A Benchmark Dataset for Document Layout Analysis
☆622Updated 11 months ago
DS4SD / DocLayNet
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
☆360Updated 2 years ago
IBM / SynthTabNet
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆147Updated 2 months ago
NormXU / ERNIE-Layout-Pytorch
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
☆106Updated last year
furkanbiten / idl_data
OCR Annotations from Amazon Textract for Industry Documents Library
☆102Updated 2 years ago
clovaai / spade
☆80Updated 2 years ago
Academic-Hammer / SciTSR
Table structure recognition dataset of the paper: Complicated Table Structure Recognition
☆373Updated 5 years ago
ibm-aur-nlp / PubTabNet
☆455Updated 3 weeks ago
phamquiluan / table-transformer
CVPR 2022: Table Structure Recognition
☆40Updated 3 years ago
andreagemelli / doc2graph
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
☆129Updated 2 years ago
Layout-Parser / layout-model-training
The scripts for training Detectron2-based Layout Models on popular layout analysis datasets
☆212Updated last year
abdoelsayed2016 / Table-Detection-Structure-Recognition
https://dl.acm.org/doi/10.1145/3657281
☆97Updated last year
doc-analysis / XFUND
XFUND: A Multilingual Form Understanding Benchmark
☆207Updated 3 years ago
entropy2333 / awesome-key-information-extraction
A curated list of papers about key information extraction.
☆97Updated 7 months ago
applicaai / lambert
Publicly released code for the LAMBERT model
☆103Updated 4 years ago
clovaai / cord
CORD: A Consolidated Receipt Dataset for Post-OCR Parsing
☆436Updated 3 years ago
herobd / dessurt
Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer
☆61Updated 2 years ago
microsoft / UDOP
☆247Updated 2 years ago
sachinraja13 / TabStructNet
☆130Updated 2 years ago
doc-analysis / ReadingBank
ReadingBank: A Benchmark Dataset for Reading Order Detection
☆107Updated 11 months ago
phamquiluan / PubLayNet
ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...
☆182Updated 4 years ago
Psarpei / Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition
☆280Updated 2 years ago
hpanwar08 / detectron2
Detectron2 for Document Layout Analysis
☆188Updated last year
cv-small-snails / Awesome-Table-Recognition
A curated list of resources dedicated to table recognition
☆403Updated 7 months ago
allanj / LayoutLMv3-DocVQA
Example codebase for fine-tuning layoutLMv3 on DocVQA
☆52Updated 2 years ago
Sanster / xy-cut
☆87Updated 3 years ago