An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
☆82Oct 14, 2023Updated 2 years ago
Alternatives and similar repositories for Layout2Graph
Users that are interested in Layout2Graph are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19May 30, 2023Updated 3 years ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆107Nov 15, 2023Updated 2 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI.☆209Mar 1, 2025Updated last year
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆288Feb 13, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- an unofficial code for augment-XY-CUT in XYLayoutLM☆30Jul 12, 2022Updated 3 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- OCR-D wrapper for detectron2 based segmentation models☆16May 1, 2025Updated last year
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆43Oct 6, 2023Updated 2 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆117Aug 26, 2024Updated last year
- DocBank: A Benchmark Dataset for Document Layout Analysis☆647Aug 12, 2024Updated last year
- ☆14Aug 31, 2023Updated 2 years ago
- ☆1,048Jul 9, 2025Updated 11 months ago
- PAGE XML format collection for document image page content and more☆71Jan 16, 2026Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆41Apr 7, 2025Updated last year
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆138Oct 18, 2025Updated 8 months ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Sep 9, 2023Updated 2 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition (ICDAR 2023)☆14Aug 29, 2023Updated 2 years ago
- OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil☆11Sep 24, 2021Updated 4 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆180Mar 18, 2023Updated 3 years ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A large scale camera-taken table detection and recognition dataset.☆150Apr 9, 2026Updated 2 months ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆17Jun 5, 2026Updated 3 weeks ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆220Sep 26, 2023Updated 2 years ago
- chinese document classification of layoutlmv3 and layoutxlm☆45Oct 25, 2022Updated 3 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆366Oct 31, 2022Updated 3 years ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…☆313Dec 2, 2024Updated last year
- ☆163Dec 27, 2022Updated 3 years ago
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆64May 15, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆440Feb 1, 2023Updated 3 years ago
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Jun 28, 2024Updated 2 years ago
- Recognize text using Calamari OCR and the OCR-D framework☆16May 13, 2025Updated last year
- Repository to use/train segmentation models for document layout analysis☆19Jan 13, 2022Updated 4 years ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆154Sep 17, 2025Updated 9 months ago
- ☆38Oct 20, 2023Updated 2 years ago
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Aug 10, 2023Updated 2 years ago