ReadingBank: A Benchmark Dataset for Reading Order Detection
☆116Aug 26, 2024Updated last year
Alternatives and similar repositories for ReadingBank
Users that are interested in ReadingBank are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DocBank: A Benchmark Dataset for Document Layout Analysis☆644Aug 12, 2024Updated last year
- XFUND: A Multilingual Form Understanding Benchmark☆219Jul 15, 2022Updated 3 years ago
- Dataset for EMNLP'23 Paper "DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading"☆11Oct 25, 2023Updated 2 years ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆320Aug 15, 2025Updated 8 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆81Oct 14, 2023Updated 2 years ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆287Feb 13, 2023Updated 3 years ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆431Feb 1, 2023Updated 3 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆364Oct 31, 2022Updated 3 years ago
- ☆16Apr 26, 2024Updated 2 years ago
- Publicly released code for the LAMBERT model☆106Jun 14, 2021Updated 4 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- Document Layout Analysis resources repos for development with PdfPig.☆634Oct 1, 2023Updated 2 years ago
- ICDAR 2021 Competition on Scientific Literature Parsing☆35Aug 20, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A curated list of resources for Document Understanding (DU) topic☆1,511Jun 2, 2023Updated 2 years ago
- EATEN: Entity-aware Attention for Single Shot Visual Text Extraction☆184Dec 29, 2019Updated 6 years ago
- TableBank: A Benchmark Dataset for Table Detection and Recognition☆1,080Aug 12, 2024Updated last year
- Table structure recognition dataset of the paper: Complicated Table Structure Recognition☆382Jul 7, 2020Updated 5 years ago
- ☆102Dec 23, 2024Updated last year
- ☆1,047Jul 9, 2025Updated 9 months ago
- ☆108Feb 16, 2021Updated 5 years ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆294Sep 13, 2021Updated 4 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆58Feb 25, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- OCR toolbox from Davar-Lab☆761Nov 16, 2023Updated 2 years ago
- ☆249Jan 22, 2023Updated 3 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI.☆207Mar 1, 2025Updated last year
- Evaluation Tool for the ICDAR 2019 Competition on Table Detection and Recognition☆42May 8, 2022Updated 3 years ago
- ☆483Jul 8, 2025Updated 9 months ago
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆18Apr 23, 2023Updated 3 years ago
- Document Visual Question Answering☆131Jul 30, 2020Updated 5 years ago
- ☆37Jan 26, 2026Updated 3 months ago
- Evaluation framework for document processing models and services.☆70Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆82Feb 8, 2023Updated 3 years ago
- ☆78Aug 7, 2023Updated 2 years ago
- ☆42Feb 6, 2021Updated 5 years ago
- ☆38Oct 20, 2023Updated 2 years ago
- A curated list of resources dedicated to table recognition☆404Dec 12, 2024Updated last year
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,829Mar 17, 2026Updated last month
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆24Mar 17, 2021Updated 5 years ago