huridocs / pdf-document-layout-analysis
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.
☆480Updated 3 weeks ago
Alternatives and similar repositories for pdf-document-layout-analysis:
Users that are interested in pdf-document-layout-analysis are comparing it to the libraries listed below
- Lightweight, performant, deep table extraction☆453Updated 3 weeks ago
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆368Updated 2 weeks ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆231Updated 4 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆210Updated 11 months ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆1,133Updated last week
- Parse PDFs into markdown using Vision LLMs☆345Updated 2 months ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆310Updated last month
- A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处…☆256Updated 2 months ago
- SmolDocling OCR App built using SmolDocling 256M Model and Streamlit.☆123Updated 3 weeks ago
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆143Updated 7 months ago
- TF-ID: Table/Figure IDentifier for academic papers☆232Updated 9 months ago
- UniTable: Towards a Unified Table Foundation Model☆461Updated 10 months ago
- Detect and extract tables to markdown and csv☆742Updated 3 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆191Updated 5 months ago
- ☆440Updated last month
- OpenAI DeepResearch alternative, An AI-driven research system that performs comprehensive, iterative research on any topic using multiple…☆561Updated last month
- LettuceDetect is a hallucination detection framework for RAG applications.☆385Updated 2 weeks ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆336Updated 2 years ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆274Updated 2 weeks ago
- ☆1,464Updated last month
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆452Updated last month
- Analysis of Chinese and English layouts 中英文版面分析☆201Updated 3 weeks ago
- Knowledge Graph Generation from Any Text☆427Updated last month
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆156Updated this week
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆597Updated 2 weeks ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆91Updated 5 months ago
- Simple package to extract text with coordinates from programmatic PDFs☆109Updated 2 weeks ago
- Prompt optimization scratch☆699Updated last week
- 🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pr…☆428Updated this week
- ☆105Updated last week