A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.
☆1,089Jan 9, 2026Updated last month
Alternatives and similar repositories for pdf-document-layout-analysis
Users that are interested in pdf-document-layout-analysis are comparing it to the libraries listed below
Sorting:
- This project aims to extract Table of Contents (TOC) information from PDF files using the outputs generated by the pdf-document-layout-an…☆20Feb 3, 2025Updated last year
- ☆15Apr 26, 2024Updated last year
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,017Apr 14, 2025Updated 10 months ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆414Feb 1, 2023Updated 3 years ago
- https://no-ocr.com/about☆178Jun 30, 2025Updated 8 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,820Apr 9, 2025Updated 10 months ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,402Jan 3, 2025Updated last year
- Toolkit for linearizing PDFs for LLM datasets/training☆16,947Feb 19, 2026Updated last week
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆312Aug 15, 2025Updated 6 months ago
- 阅读顺序、Layoutreader☆19May 8, 2025Updated 9 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,089Feb 10, 2025Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,360Feb 24, 2026Updated last week
- python package to parse pdfs with different parsers☆248Sep 12, 2025Updated 5 months ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆55,275Updated this week
- OCR & Document Extraction using vision models☆12,155May 20, 2025Updated 9 months ago
- Yet Another Document Translator☆7,810Feb 15, 2026Updated 2 weeks ago
- Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical …☆647Updated this week
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,862Aug 25, 2025Updated 6 months ago
- PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.☆4,895Feb 12, 2026Updated 2 weeks ago
- A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具☆1,665Jan 25, 2026Updated last month
- Convert Everything to PDF☆220Feb 1, 2026Updated last month
- yet another m3u8 player☆13Jun 8, 2025Updated 8 months ago
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,660Aug 15, 2024Updated last year
- Convert PDF to markdown + JSON quickly with high accuracy☆32,069Updated this week
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆682May 20, 2025Updated 9 months ago
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆328Feb 9, 2025Updated last year
- UniTable: Towards a Unified Table Foundation Model☆525Jun 4, 2024Updated last year
- Completely free, private, UI based Tech Documentation MCP server. Designed for coders and software developers in mind. Easily integrate i…☆2,036Feb 4, 2026Updated last month
- A Repo For Document AI☆3,139Updated this week
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆151Feb 4, 2026Updated last month
- ☆883Feb 13, 2026Updated 2 weeks ago
- An agentic company research tool powered by LangGraph and Tavily that conducts deep diligence on companies using a multi-agent framework.…☆1,602Updated this week
- An AI-driven daily arXiv paper crawler, analyzer, and organizer tool, focusing on AIGC☆76Updated this week
- OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex lay…☆2,484Aug 4, 2025Updated 7 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,940Sep 24, 2025Updated 5 months ago
- Document Layout Analysis resources repos for development with PdfPig.☆631Oct 1, 2023Updated 2 years ago
- Get beautiful, world-class documentation for any repo☆426Apr 3, 2025Updated 11 months ago
- Get your documents ready for gen AI☆54,094Feb 24, 2026Updated last week
- [EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,…☆31,890Nov 25, 2025Updated 3 months ago