Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
☆362Oct 31, 2022Updated 3 years ago
Alternatives and similar repositories for LiLT
Users that are interested in LiLT are comparing it to the libraries listed below
Sorting:
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆288Feb 13, 2023Updated 3 years ago
- ☆161Dec 27, 2022Updated 3 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆203Mar 1, 2025Updated last year
- Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…☆569Jul 25, 2024Updated last year
- Publicly released code for the LAMBERT model☆105Jun 14, 2021Updated 4 years ago
- A curated list of resources for Document Understanding (DU) topic☆1,501Jun 2, 2023Updated 2 years ago
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆20Dec 4, 2024Updated last year
- ☆81Jun 12, 2023Updated 2 years ago
- DocILE: Document Information Localization and Extraction Benchmark☆142May 15, 2024Updated last year
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆53Jan 9, 2024Updated 2 years ago
- ☆249Jan 22, 2023Updated 3 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Aug 8, 2023Updated 2 years ago
- XFUND: A Multilingual Form Understanding Benchmark☆217Jul 15, 2022Updated 3 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆634Aug 12, 2024Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆107Nov 15, 2023Updated 2 years ago
- ☆108Feb 16, 2021Updated 5 years ago
- OCR toolbox from Davar-Lab☆759Nov 16, 2023Updated 2 years ago
- This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …☆182Sep 15, 2021Updated 4 years ago
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆62Jan 11, 2023Updated 3 years ago
- A curated list of papers about key information extraction.☆105Dec 18, 2024Updated last year
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆152Sep 17, 2025Updated 5 months ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Aug 20, 2022Updated 3 years ago
- 2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.☆468Jul 4, 2022Updated 3 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆117Aug 26, 2024Updated last year
- Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)☆144Jul 26, 2023Updated 2 years ago
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,792Jul 11, 2024Updated last year
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- 视觉预训练基础模型仓库☆501Apr 12, 2023Updated 2 years ago
- The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…☆306Dec 2, 2024Updated last year
- ☆18Jun 7, 2023Updated 2 years ago
- an unofficial code for augment-XY-CUT in XYLayoutLM☆30Jul 12, 2022Updated 3 years ago
- Table structure recognition dataset of the paper: Complicated Table Structure Recognition☆379Jul 7, 2020Updated 5 years ago
- EATEN: Entity-aware Attention for Single Shot Visual Text Extraction☆184Dec 29, 2019Updated 6 years ago
- ☆102Dec 23, 2024Updated last year
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆52Sep 19, 2022Updated 3 years ago
- A curated list of resources dedicated to table recognition☆406Dec 12, 2024Updated last year
- [Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"☆17Dec 1, 2023Updated 2 years ago
- Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021☆572Jun 14, 2024Updated last year