CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
☆784Feb 7, 2026Updated last month
Alternatives and similar repositories for CnSTD
Users that are interested in CnSTD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,737Feb 7, 2026Updated last month
- Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集☆34Dec 21, 2022Updated 3 years ago
- An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…☆3,059Feb 7, 2026Updated last month
- 基于cnstd+cnocr作为基础,封装的一个ocr的web服务☆10Nov 21, 2021Updated 4 years ago
- 基于Pytorch的OCR工具库,支持常用的文字检测和识别算法☆1,515Jan 4, 2026Updated 2 months ago
- 文档方向分类☆222Feb 3, 2026Updated last month
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,377Jan 12, 2026Updated 2 months ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆289Sep 13, 2021Updated 4 years ago
- TDF-ICDAR 2019 Dataset for Typeset Math Formula Detection☆69Feb 9, 2020Updated 6 years ago
- FormulaNet is a new large-scale Mathematical Formula Detection dataset.☆20Nov 21, 2022Updated 3 years ago
- yolo3+ocr☆6,120Aug 29, 2022Updated 3 years ago
- Scanning Single Shot Detector for Math in Document Images☆133Apr 18, 2023Updated 2 years ago
- 📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.☆6,153Updated this week
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,275Aug 14, 2023Updated 2 years ago
- Analysis of Chinese and English layouts 中英文版面分析☆269Mar 6, 2026Updated 2 weeks ago
- 1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection(公式检测冠军方案)☆133Sep 4, 2023Updated 2 years ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,825Mar 17, 2026Updated last week
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆382Nov 3, 2024Updated last year
- 收集并整理有关OCR的数据集并统一标注格式,以便实验需要☆966Nov 28, 2023Updated 2 years ago
- 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reaso…☆1,287Jun 11, 2024Updated last year
- mxnet-Gluon implementation of PSENet text detector (Shape Robust Text Detection with Progressive Scale Expansion Network)☆18Jun 17, 2019Updated 6 years ago
- This repo is used to release the ArxivFormula dataset.☆35Nov 12, 2024Updated last year
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆459Sep 28, 2025Updated 5 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆317Aug 15, 2025Updated 7 months ago
- The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.☆513Feb 1, 2026Updated last month
- 基于pytorch的ocr算法库,包括 psenet, pan, dbnet, sast , crnn☆679May 19, 2021Updated 4 years ago
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆72,686Updated this week
- 利用语言模型,纠正OCR识别错误☆475May 22, 2023Updated 2 years ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆415Sep 4, 2025Updated 6 months ago
- PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)☆1,153Sep 11, 2025Updated 6 months ago
- A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".☆2,244Mar 11, 2024Updated 2 years ago
- Generate text line images for training deep learning OCR models☆902Jan 17, 2026Updated 2 months ago
- OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commer…☆1,289Mar 2, 2026Updated 3 weeks ago
- end2end layout analysis based seq2seq☆132Mar 8, 2021Updated 5 years ago
- Learning Generative Structure Prior for Blind Text Image Super-resolution [CVPR 2023]☆261Aug 18, 2025Updated 7 months ago
- Document Rectification and Illumination Correction using a Patch-based CNN☆396Sep 28, 2022Updated 3 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆642Aug 12, 2024Updated last year
- TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…☆726Aug 22, 2025Updated 7 months ago
- OCR toolbox from Davar-Lab☆762Nov 16, 2023Updated 2 years ago