huridocs/pdf-document-layout-analysis

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huridocs/pdf-document-layout-analysis)

huridocs / pdf-document-layout-analysis

A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.

☆1,273

Alternatives and similar repositories for pdf-document-layout-analysis

Users that are interested in pdf-document-layout-analysis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huridocs / pdf-table-of-contents-extractor
View on GitHub
This project aims to extract Table of Contents (TOC) information from PDF files using the outputs generated by the pdf-document-layout-an…
☆21Feb 3, 2025Updated last year
opendatalab / DocLayout-YOLO
View on GitHub
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
☆2,235Apr 14, 2025Updated last year
ispras / dedoc
View on GitHub
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical …
☆715Jul 18, 2026Updated last week
chatclimate-ai / ParseStudio
View on GitHub
python package to parse pdfs with different parsers
☆269Sep 12, 2025Updated 10 months ago
alibaba / Logics-Parsing
View on GitHub
☆1,394May 13, 2026Updated 2 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
DS4SD / DocLayNet
View on GitHub
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
☆450Feb 1, 2023Updated 3 years ago
allenai / olmocr
View on GitHub
Toolkit for linearizing PDFs for LLM datasets/training
☆19,182Mar 25, 2026Updated 4 months ago
AlibabaResearch / AdvancedLiterateMachinery
View on GitHub
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…
☆1,833Mar 17, 2026Updated 4 months ago
funstory-ai / BabelDOC
View on GitHub
Yet Another Document Translator
☆8,996Jul 16, 2026Updated last week
NanoNets / docext
View on GitHub
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
☆2,032Mar 17, 2026Updated 4 months ago
datalab-to / marker
View on GitHub
Convert PDF to markdown + JSON quickly with high accuracy
☆37,843Updated this week
MarkPDFdown / markpdfdown
View on GitHub
A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具
☆1,930Jan 25, 2026Updated 6 months ago
bytedance / Dolphin
View on GitHub
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
☆9,039Mar 25, 2026Updated 4 months ago
opendatalab / PDF-Extract-Kit
View on GitHub
A Comprehensive Toolkit for High-Quality PDF Content Extraction
☆9,806Jan 3, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
datalab-to / surya
View on GitHub
OCR, layout analysis, reading order, table recognition in 90+ languages
☆21,149Updated this week
oomol-lab / pdf-craft
View on GitHub
PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.
☆6,024Jun 27, 2026Updated 3 weeks ago
FreeOCR-AI / layoutreader
View on GitHub
A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.
☆322Aug 15, 2025Updated 11 months ago
chatdoc-com / OCRFlux
View on GitHub
OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex lay…
☆2,523Apr 14, 2026Updated 3 months ago
Ucas-HaoranWei / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,158Feb 10, 2025Updated last year
Yuliang-Liu / MonkeyOCR
View on GitHub
A lightweight LMM-based Document Parsing Model
☆6,607Updated this week
huridocs / pdf-reading-order
View on GitHub
☆16Apr 26, 2024Updated 2 years ago
studio-dots-ai / dots.ocr
View on GitHub
Multilingual Document Layout Parsing in a Single Vision-Language Model
☆9,028Mar 24, 2026Updated 4 months ago
opendatalab / MinerU
View on GitHub
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
☆75,694Updated this week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
opendatalab / OmniDocBench
View on GitHub
[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
☆1,914Updated this week
datalab-to / chandra
View on GitHub
OCR model that handles complex tables, forms, handwriting with full layout.
☆11,774Jun 26, 2026Updated last month
docling-project / docling
View on GitHub
Get your documents ready for gen AI
☆63,762Updated this week
shcherbak-ai / contextgem
View on GitHub
ContextGem: Effortless LLM extraction from documents
☆1,863Jun 6, 2026Updated last month
getomni-ai / zerox
View on GitHub
OCR & Document Extraction using vision models
☆12,258May 20, 2025Updated last year
kyryl-opens-ml / no-ocr
View on GitHub
https://no-ocr.com/about
☆182Jun 30, 2025Updated last year
deepdoctection / deepdoctection
View on GitHub
A Repo For Document AI
☆3,198Jun 20, 2026Updated last month
poloclub / unitable
View on GitHub
UniTable: Towards a Unified Table Foundation Model
☆534Apr 21, 2026Updated 3 months ago
yujunhuics / LayoutReader
View on GitHub
阅读顺序、Layoutreader
☆18May 8, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Layout-Parser / layout-parser
View on GitHub
A Unified Toolkit for Deep Learning Based Document Image Analysis
☆5,765Aug 15, 2024Updated last year
pymupdf / PyMuPDF
View on GitHub
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
☆10,315Updated this week
opendataloader-project / opendataloader-pdf
View on GitHub
PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.
☆27,841Updated this week
lumina-ai-inc / chunkr
View on GitHub
Vision infrastructure to turn complex documents into RAG/LLM-ready data
☆4,045Apr 9, 2026Updated 3 months ago
BobLd / DocumentLayoutAnalysis
View on GitHub
Document Layout Analysis resources repos for development with PdfPig.
☆637Oct 1, 2023Updated 2 years ago
yobix-ai / extractous
View on GitHub
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
☆1,768Dec 21, 2024Updated last year
PaddlePaddle / PaddleOCR
View on GitHub
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…
☆86,238Updated this week