An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆81Oct 14, 2023Updated 2 years ago
Alternatives and similar repositories for Layout2Graph
Users that are interested in Layout2Graph are comparing it to the libraries listed below
Sorting:
- ☆18May 30, 2023Updated 2 years ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆107Nov 15, 2023Updated 2 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI.☆205Mar 1, 2025Updated last year
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆288Feb 13, 2023Updated 3 years ago
- an unofficial code for augment-XY-CUT in XYLayoutLM☆30Jul 12, 2022Updated 3 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- OCR-D wrapper for detectron2 based segmentation models☆17May 1, 2025Updated 10 months ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Oct 6, 2023Updated 2 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆117Aug 26, 2024Updated last year
- ☆14Aug 31, 2023Updated 2 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆639Aug 12, 2024Updated last year
- ☆1,042Jul 9, 2025Updated 8 months ago
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆41Apr 7, 2025Updated 11 months ago
- PAGE XML format collection for document image page content and more☆70Jan 16, 2026Updated 2 months ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆137Oct 18, 2025Updated 5 months ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Sep 9, 2023Updated 2 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition (ICDAR 2023)☆15Aug 29, 2023Updated 2 years ago
- OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil☆11Sep 24, 2021Updated 4 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Mar 18, 2023Updated 3 years ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- A large scale camera-taken table detection and recognition dataset.☆149Jul 21, 2025Updated 7 months ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆15Jan 20, 2026Updated 2 months ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆218Sep 26, 2023Updated 2 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆362Oct 31, 2022Updated 3 years ago
- chinese document classification of layoutlmv3 and layoutxlm☆46Oct 25, 2022Updated 3 years ago
- The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…☆306Dec 2, 2024Updated last year
- ☆161Dec 27, 2022Updated 3 years ago
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆64May 15, 2025Updated 10 months ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆423Feb 1, 2023Updated 3 years ago
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Jun 28, 2024Updated last year
- Recognize text using Calamari OCR and the OCR-D framework☆15May 13, 2025Updated 10 months ago
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Aug 10, 2023Updated 2 years ago
- Repository to use/train segmentation models for document layout analysis☆19Jan 13, 2022Updated 4 years ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆152Sep 17, 2025Updated 6 months ago
- ☆38Oct 20, 2023Updated 2 years ago