An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆81Oct 14, 2023Updated 2 years ago
Alternatives and similar repositories for Layout2Graph
Users that are interested in Layout2Graph are comparing it to the libraries listed below
Sorting:
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆107Nov 15, 2023Updated 2 years ago
- ☆18May 30, 2023Updated 2 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆203Mar 1, 2025Updated 11 months ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆288Feb 13, 2023Updated 3 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 5 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆116Aug 26, 2024Updated last year
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Sep 9, 2023Updated 2 years ago
- an unofficial code for augment-XY-CUT in XYLayoutLM☆30Jul 12, 2022Updated 3 years ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆137Oct 18, 2025Updated 4 months ago
- ☆38Oct 20, 2023Updated 2 years ago
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Jun 28, 2024Updated last year
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆41Apr 7, 2025Updated 10 months ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- A large scale camera-taken table detection and recognition dataset.☆149Jul 21, 2025Updated 7 months ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆75Sep 12, 2024Updated last year
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Oct 6, 2023Updated 2 years ago
- PAGE XML format collection for document image page content and more☆70Jan 16, 2026Updated last month
- ☆1,039Jul 9, 2025Updated 7 months ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆152Sep 17, 2025Updated 5 months ago
- ☆45Jul 18, 2022Updated 3 years ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition (ICDAR 2023)☆15Aug 29, 2023Updated 2 years ago
- OCR-D wrapper for detectron2 based segmentation models☆17May 1, 2025Updated 9 months ago
- ☆161Dec 27, 2022Updated 3 years ago
- Table structure recognition dataset of the paper: Complicated Table Structure Recognition☆379Jul 7, 2020Updated 5 years ago
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Dec 9, 2022Updated 3 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆361Oct 31, 2022Updated 3 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Aug 8, 2023Updated 2 years ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆32Jun 12, 2025Updated 8 months ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Mar 18, 2023Updated 2 years ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆53Jan 9, 2024Updated 2 years ago
- Distorted Document Images dataset (DDI-100).☆146Nov 1, 2022Updated 3 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Apr 3, 2024Updated last year
- ☆27Nov 29, 2023Updated 2 years ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated 11 months ago
- The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…☆306Dec 2, 2024Updated last year
- ☆97Jul 12, 2022Updated 3 years ago
- ICDAR 2024 Table OCR Model☆39Feb 20, 2026Updated last week
- The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:…☆282May 30, 2025Updated 9 months ago