An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆81Oct 14, 2023Updated 2 years ago
Alternatives and similar repositories for Layout2Graph
Users that are interested in Layout2Graph are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18May 30, 2023Updated 2 years ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆107Nov 15, 2023Updated 2 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI.☆206Mar 1, 2025Updated last year
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆287Feb 13, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- an unofficial code for augment-XY-CUT in XYLayoutLM☆30Jul 12, 2022Updated 3 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- OCR-D wrapper for detectron2 based segmentation models☆17May 1, 2025Updated 11 months ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Oct 6, 2023Updated 2 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆117Aug 26, 2024Updated last year
- DocBank: A Benchmark Dataset for Document Layout Analysis☆641Aug 12, 2024Updated last year
- ☆14Aug 31, 2023Updated 2 years ago
- ☆1,047Jul 9, 2025Updated 9 months ago
- PAGE XML format collection for document image page content and more☆71Jan 16, 2026Updated 2 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆41Apr 7, 2025Updated last year
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆137Oct 18, 2025Updated 5 months ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Sep 9, 2023Updated 2 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition (ICDAR 2023)☆14Aug 29, 2023Updated 2 years ago
- OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil☆11Sep 24, 2021Updated 4 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆180Mar 18, 2023Updated 3 years ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A large scale camera-taken table detection and recognition dataset.☆149Jul 21, 2025Updated 8 months ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆15Jan 20, 2026Updated 2 months ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆218Sep 26, 2023Updated 2 years ago
- chinese document classification of layoutlmv3 and layoutxlm☆46Oct 25, 2022Updated 3 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆363Oct 31, 2022Updated 3 years ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…☆307Dec 2, 2024Updated last year