RapidAI/RapidLayout

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RapidAI/RapidLayout)

RapidAI / RapidLayout

Analysis of Chinese and English layouts 中英文版面分析

☆275

Alternatives and similar repositories for RapidLayout

Users that are interested in RapidLayout are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RapidAI / RapidDocEx
View on GitHub
📝 针对文档类图像做内容提取，将文档类图像一比一输出到Word或者Txt中，便于进一步使用或处理。后续计划支持输入PDF/图像，输出对应json格式、Txt格式、Word格式和Markdown格式。
☆208Nov 1, 2024Updated last year
RapidAI / RapidTable
View on GitHub
基于序列表格识别算法推理库，集成PP-Structure和modelscope等表格识别算法。
☆432Apr 23, 2026Updated 2 months ago
RapidAI / TableStructureRec
View on GitHub
整理目前开源的最优表格识别模型，完善前后处理，模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…
☆954Aug 3, 2025Updated 11 months ago
Sanster / OhMyTable
View on GitHub
Table Structure Recognition
☆28Jul 25, 2024Updated last year
RapidAI / RapidUnDistort
View on GitHub
修正文档扭曲/模糊/阴影等情况，使用onnx模型简单轻量部署，未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We wi…
☆105Dec 17, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
360AILAB-NLP / 360LayoutAnalysis
View on GitHub
360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute
☆305Sep 10, 2024Updated last year
opendatalab / DocLayout-YOLO
View on GitHub
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
☆2,232Apr 14, 2025Updated last year
RapidAI / RapidTableDetection
View on GitHub
检测和提取各种场景图片中的表格区域，并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation i…
☆119Dec 10, 2024Updated last year
RapidAI / RapidOrientation
View on GitHub
文档方向分类
☆221Feb 3, 2026Updated 5 months ago
FreeOCR-AI / layoutreader
View on GitHub
A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.
☆322Aug 15, 2025Updated 11 months ago
buptlihang / CDLA
View on GitHub
CDLA: A Chinese document layout analysis (CDLA) dataset
☆293Sep 13, 2021Updated 4 years ago
hiroi-sora / GapTree_Sort_Algorithm
View on GitHub
【间隙·树·排序算法】对OCR结果或PDF提取的文本进行版面分析，按人类阅读顺序进行排序。
☆167Feb 28, 2024Updated 2 years ago
RapidAI / RapidOCR
View on GitHub
📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.
☆7,214Jul 9, 2026Updated last week
jingsongliujing / OnnxOCR
View on GitHub
基于PaddleOCR重构，并且脱离PaddlePaddle深度学习训练框架的轻量级OCR，推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle d…
☆1,836Jun 11, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
RapidAI / RapidOCRPDF
View on GitHub
Based on RapidOCR, extract the PDF content
☆191Mar 6, 2026Updated 4 months ago
opendatalab / PDF-Extract-Kit
View on GitHub
A Comprehensive Toolkit for High-Quality PDF Content Extraction
☆9,796Jan 3, 2025Updated last year
SWHL / LGPMA_Infer
View on GitHub
表格结构识别LGPMA推理
☆25Nov 17, 2022Updated 3 years ago
SWHL / TrOCR-Formula-Rec
View on GitHub
基于TrOCR + UniMER-1M数据集，训练一个小而美的公式识别模型
☆30Mar 17, 2026Updated 4 months ago
Gmgge / TrOCR-Seal-Recognition
View on GitHub
基于transformer的ocr识别，在公章(印章识别, seal recognition）拓展应用
☆297Oct 24, 2025Updated 8 months ago
SWHL / ChineseDocumentPDF
View on GitHub
中文论文、证券类、财报类PDF数据
☆41Jun 13, 2024Updated 2 years ago
RapidAI / RapidDoc
View on GitHub
A high-performance, open-source PDF data extraction tool. 一站式开源高性能数据提取工具，将复杂 PDF 文档转换为 Markdown 和 JSON 格式，使用onnx模型。
☆202Jul 12, 2026Updated last week
breezedeus / CnSTD
View on GitHub
CnSTD: 基于 PyTorch/MXNet 的中文/英文场景文字检测（Scene Text Detection）、数学公式检测（Mathematical Formula Detection, MFD）、篇章分析（Layout Analysis）的Python3 包
☆792Jul 5, 2026Updated 2 weeks ago
BADBADBADBOY / CardDetectRotate
View on GitHub
卡证和文档检测和矫正
☆87Sep 18, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
360AILABNLP / 360LayoutAnalysis
View on GitHub
☆28Oct 14, 2024Updated last year
CosmosShadow / gptpdf
View on GitHub
Using GPT to parse PDF
☆3,558Apr 17, 2025Updated last year
RapidAI / PaddleOCRModelConvert
View on GitHub
Convert the model in PaddleOCR to ONNX format
☆120Jul 15, 2025Updated last year
AlibabaResearch / AdvancedLiterateMachinery
View on GitHub
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…
☆1,833Mar 17, 2026Updated 4 months ago
frotms / PaddleOCR2Pytorch
View on GitHub
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
☆1,197Jul 10, 2026Updated last week
tianchiguaixia / layoutlmv3-chinese
View on GitHub
该项目是为了使用layoutlmv3针对中文图片训练和推理。其中主要解决三个问题： 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作
☆63Sep 6, 2024Updated last year
Ucas-HaoranWei / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,154Feb 10, 2025Updated last year
yujunhuics / LayoutReader
View on GitHub
阅读顺序、Layoutreader
☆18May 8, 2025Updated last year
Royalvice / DocDiff
View on GitHub
ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along w…
☆350Aug 22, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Topdu / OpenOCR
View on GitHub
OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commer…
☆1,415May 20, 2026Updated 2 months ago
poloclub / unitable
View on GitHub
UniTable: Towards a Unified Table Foundation Model
☆533Apr 21, 2026Updated 2 months ago
fanqie03 / char-detection
View on GitHub
🔥Char detection base on crnn 字符（单字）检测基于CRNN
☆90May 16, 2023Updated 3 years ago
InternScience / StructEqTable-Deploy
View on GitHub
A High-efficiency Open-source Toolkit for Table-to-Latex Task
☆276Dec 6, 2025Updated 7 months ago
ZZZHANG-jx / DocRes
View on GitHub
[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
☆626Aug 3, 2025Updated 11 months ago
dirac472 / tableOCR
View on GitHub
识别图像中的表格+OCR识别
☆26Mar 8, 2024Updated 2 years ago
Prakhar-97 / Table-detection-and-Document-layout-analysis
View on GitHub
☆10Jun 22, 2020Updated 6 years ago