huridocs / pdf-document-layout-analysisLinks
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.
☆627Updated last month
Alternatives and similar repositories for pdf-document-layout-analysis
Users that are interested in pdf-document-layout-analysis are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆602Updated 2 weeks ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆1,451Updated 3 months ago
- Lightweight, performant, deep table extraction☆487Updated this week
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,511Updated 2 weeks ago
- Parse PDFs into markdown using Vision LLMs☆395Updated 5 months ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆253Updated 7 months ago
- Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.☆877Updated 9 months ago
- Detect and extract tables to markdown and csv☆749Updated 5 months ago
- ☆529Updated 11 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆257Updated last month
- SmolDocling OCR App built using SmolDocling 256M Model and Streamlit.☆148Updated 3 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆197Updated 8 months ago
- ☆477Updated 4 months ago
- python package to parse pdfs with different parsers☆197Updated 7 months ago
- Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical …☆516Updated this week
- OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex lay…☆1,767Updated last week
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆144Updated 10 months ago
- A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处…☆276Updated last month
- Analysis of Chinese and English layouts 中英文版面分析☆226Updated this week
- RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF☆980Updated 2 weeks ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆332Updated this week
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆379Updated last month
- TF-ID: Table/Figure IDentifier for academic papers☆238Updated last year
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,746Updated 3 months ago
- UniTable: Towards a Unified Table Foundation Model☆485Updated last year
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,694Updated 4 months ago
- E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with ded…☆1,093Updated 10 months ago
- [ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆493Updated last month
- 如需体验TextIn文档解析,请访问 https://cc.co/16YSIy☆149Updated last month
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆657Updated last month