DS4SD/DocLayNet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DS4SD/DocLayNet)

DS4SD / DocLayNet

DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis

☆446

Alternatives and similar repositories for DocLayNet

Users that are interested in DocLayNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

doc-analysis / DocBank
View on GitHub
DocBank: A Benchmark Dataset for Document Layout Analysis
☆652Aug 12, 2024Updated last year
DS4SD / deepsearch-toolkit
View on GitHub
Interact with the Deep Search platform for new knowledge explorations and discoveries
☆228Jan 24, 2025Updated last year
HCIILAB / M6Doc
View on GitHub
☆163May 8, 2025Updated last year
ibm-aur-nlp / PubLayNet
View on GitHub
☆1,051Jul 9, 2025Updated last year
allenai / vila
View on GitHub
Incorporating VIsual LAyout Structures for Scientific Text Classification
☆180Mar 18, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
docling-project / docling-ibm-models
View on GitHub
☆207Jun 4, 2026Updated last month
buptlihang / CDLA
View on GitHub
CDLA: A Chinese document layout analysis (CDLA) dataset
☆293Sep 13, 2021Updated 4 years ago
DS4SD / deepsearch-glm
View on GitHub
Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
☆60Jan 27, 2025Updated last year
DS4SD / deepsearch-examples
View on GitHub
Examples using the Deep Search functionalities
☆89Jan 29, 2025Updated last year
doc-analysis / ReadingBank
View on GitHub
ReadingBank: A Benchmark Dataset for Reading Order Detection
☆117Aug 26, 2024Updated last year
BobLd / DocumentLayoutAnalysis
View on GitHub
Document Layout Analysis resources repos for development with PdfPig.
☆637Oct 1, 2023Updated 2 years ago
DS4SD / quackling
View on GitHub
Build document-native LLM applications
☆58Sep 11, 2024Updated last year
microsoft / CompHRDoc
View on GitHub
Datasets and Evaluation Scripts for CompHRDoc
☆59Feb 25, 2025Updated last year
bertsky / ocrd_publaynet
View on GitHub
convert PubLayNet data into METS/PAGE-XML
☆10Mar 17, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
LivingSkyTechnologies / Document_Layout_Segmentation
View on GitHub
Repository to use/train segmentation models for document layout analysis
☆19Jan 13, 2022Updated 4 years ago
doc-analysis / XFUND
View on GitHub
XFUND: A Multilingual Form Understanding Benchmark
☆223Jul 15, 2022Updated 4 years ago
rossumai / docile
View on GitHub
DocILE: Document Information Localization and Extraction Benchmark
☆149Jun 17, 2026Updated last month
AlibabaResearch / AdvancedLiterateMachinery
View on GitHub
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…
☆1,833Mar 17, 2026Updated 4 months ago
IBM / SynthTabNet
View on GitHub
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆154Sep 17, 2025Updated 10 months ago
adlnlp / doc_gcn
View on GitHub
☆19May 30, 2023Updated 3 years ago
LynnHaDo / Document-Layout-Analysis
View on GitHub
Object Detection Model for Scanned Documents
☆95Mar 6, 2025Updated last year
bertsky / ocrd_detectron2
View on GitHub
OCR-D wrapper for detectron2 based segmentation models
☆16May 1, 2025Updated last year
FreeOCR-AI / yolo-doclaynet
View on GitHub
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
☆158Mar 10, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jpWang / LiLT
View on GitHub
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…
☆366Oct 31, 2022Updated 3 years ago
Layout-Parser / layout-parser
View on GitHub
A Unified Toolkit for Deep Learning Based Document Image Analysis
☆5,763Aug 15, 2024Updated last year
opendatalab / DocLayout-YOLO
View on GitHub
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
☆2,232Apr 14, 2025Updated last year
clovaai / donut
View on GitHub
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
☆6,905Jul 11, 2024Updated 2 years ago
moured / YOLOv10-Document-Layout-Analysis
View on GitHub
YOLOv10 trained on DocLayNet dataset.
☆82Nov 1, 2024Updated last year
hikopensource / DAVAR-Lab-OCR
View on GitHub
OCR toolbox from Davar-Lab
☆762Jun 29, 2026Updated 3 weeks ago
ibm-aur-nlp / PubTabNet
View on GitHub
☆483Jul 8, 2025Updated last year
LivingSkyTechnologies / Dense_Article_Dataset_DAD
View on GitHub
Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis
☆16Jan 13, 2022Updated 4 years ago
biswassanket / synth_doc_generation
View on GitHub
Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021
☆93Jul 16, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ayanban011 / SwinDocSegmenter
View on GitHub
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
☆74Sep 12, 2024Updated last year
shabie / docformer
View on GitHub
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…
☆290Feb 13, 2023Updated 3 years ago
Layout-Parser / layout-model-training
View on GitHub
The scripts for training Detectron2-based Layout Models on popular layout analysis datasets
☆220Sep 26, 2023Updated 2 years ago
mindee / doctr
View on GitHub
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning. Ongo…
☆6,186Updated this week
microsoft / table-transformer
View on GitHub
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…
☆2,930Jun 24, 2024Updated 2 years ago
wanghaisheng / ocr-arxiv-daily
View on GitHub
☆19Jun 7, 2023Updated 3 years ago
OCR-D / ocrd_anybaseocr
View on GitHub
DFKI Layout Detection for OCR-D
☆47May 1, 2025Updated last year