CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
☆781Feb 7, 2026Updated 3 weeks ago
Alternatives and similar repositories for CnSTD
Users that are interested in CnSTD are comparing it to the libraries listed below
Sorting:
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,730Feb 7, 2026Updated 3 weeks ago
- Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集☆34Dec 21, 2022Updated 3 years ago
- An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…☆3,020Feb 7, 2026Updated 3 weeks ago
- 基于Pytorch的OCR工具库,支持常用的文字检测和识别算法☆1,513Jan 4, 2026Updated 2 months ago
- 基于cnstd+cnocr作为基础,封装的一个ocr的web服务☆11Nov 21, 2021Updated 4 years ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆288Sep 13, 2021Updated 4 years ago
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,376Jan 12, 2026Updated last month
- 文档方向分类☆222Feb 3, 2026Updated last month
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆380Nov 3, 2024Updated last year
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, MNN, PaddlePaddle and PyTorch.☆6,021Updated this week
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,820Apr 9, 2025Updated 10 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆312Aug 15, 2025Updated 6 months ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆458Sep 28, 2025Updated 5 months ago
- yolo3+ocr☆6,116Aug 29, 2022Updated 3 years ago
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,274Aug 14, 2023Updated 2 years ago
- Scanning Single Shot Detector for Math in Document Images☆133Apr 18, 2023Updated 2 years ago
- Analysis of Chinese and English layouts 中英文版面分析☆267Feb 25, 2026Updated last week
- 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reaso…☆1,281Jun 11, 2024Updated last year
- 收集并整理有关OCR的数据集并统一标注格式,以便实验需要☆965Nov 28, 2023Updated 2 years ago
- TDF-ICDAR 2019 Dataset for Typeset Math Formula Detection☆69Feb 9, 2020Updated 6 years ago
- 1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection(公式检测冠军方案)☆133Sep 4, 2023Updated 2 years ago
- 基于pytorch的ocr算法库,包括 psenet, pan, dbnet, sast , crnn☆679May 19, 2021Updated 4 years ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆410Sep 4, 2025Updated 6 months ago
- Table Structure Recognition☆81Mar 11, 2023Updated 2 years ago
- This repo is used to release the ArxivFormula dataset.☆35Nov 12, 2024Updated last year
- FormulaNet is a new large-scale Mathematical Formula Detection dataset.☆20Nov 21, 2022Updated 3 years ago
- The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.☆505Feb 1, 2026Updated last month
- PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)☆1,140Sep 11, 2025Updated 5 months ago
- Generate text line images for training deep learning OCR models☆898Jan 17, 2026Updated last month
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆924Aug 3, 2025Updated 7 months ago
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆71,369Updated this week
- mxnet-Gluon implementation of PSENet text detector (Shape Robust Text Detection with Progressive Scale Expansion Network)☆18Jun 17, 2019Updated 6 years ago
- (WIP)The deployment framework aims to provide a simple, lightweight, fast integrated, pipelined deployment framework for algorithm servic…☆135Jul 21, 2021Updated 4 years ago
- Document Rectification and Illumination Correction using a Patch-based CNN☆396Sep 28, 2022Updated 3 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆635Aug 12, 2024Updated last year
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,017Apr 14, 2025Updated 10 months ago
- A toolbox of ocr models and algorithms based on MindSpore☆299Jul 24, 2025Updated 7 months ago
- A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".☆2,246Mar 11, 2024Updated last year
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆306Sep 10, 2024Updated last year