CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
☆792May 1, 2026Updated last month
Alternatives and similar repositories for CnSTD
Users that are interested in CnSTD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,753Feb 7, 2026Updated 4 months ago
- Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集☆34Dec 21, 2022Updated 3 years ago
- An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…☆3,148Feb 7, 2026Updated 4 months ago
- 基于cnstd+cnocr作为基础,封装的一个ocr的web服务☆10Nov 21, 2021Updated 4 years ago
- 基于Pytorch的OCR工具库,支持常用的文字检测和识别算法☆1,522Jan 4, 2026Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 文档方向分类☆221Feb 3, 2026Updated 4 months ago
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,376Jan 12, 2026Updated 5 months ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆295Sep 13, 2021Updated 4 years ago
- TDF-ICDAR 2019 Dataset for Typeset Math Formula Detection☆69Feb 9, 2020Updated 6 years ago
- FormulaNet is a new large-scale Mathematical Formula Detection dataset.☆21Nov 21, 2022Updated 3 years ago
- yolo3+ocr☆6,113Aug 29, 2022Updated 3 years ago
- 📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.☆6,770May 22, 2026Updated 3 weeks ago
- Scanning Single Shot Detector for Math in Document Images☆133Apr 18, 2023Updated 3 years ago
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,315May 18, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Analysis of Chinese and English layouts 中英文版面分析☆273Mar 24, 2026Updated 2 months ago
- 1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection(公式检测冠军方案)☆134Sep 4, 2023Updated 2 years ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,829Mar 17, 2026Updated 2 months ago
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆387Nov 3, 2024Updated last year
- 收集并整理有关OCR的数据集并统一标注格式,以便实验需要☆971Nov 28, 2023Updated 2 years ago
- 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reaso…☆1,299Jun 11, 2024Updated 2 years ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆483Sep 28, 2025Updated 8 months ago
- mxnet-Gluon implementation of PSENet text detector (Shape Robust Text Detection with Progressive Scale Expansion Network)☆18Jun 17, 2019Updated 6 years ago
- This repo is used to release the ArxivFormula dataset.☆35Nov 12, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆320Aug 15, 2025Updated 9 months ago
- The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.☆524Feb 1, 2026Updated 4 months ago
- 基于pytorch的ocr算法库,包括 psenet, pan, dbnet, sast , crnn☆680May 19, 2021Updated 5 years ago
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆82,075Updated this week
- 利用语言模型,纠正OCR识别错误☆473May 22, 2023Updated 3 years ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆424Apr 23, 2026Updated last month
- PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)☆1,182Sep 11, 2025Updated 9 months ago
- A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".☆2,253Mar 11, 2024Updated 2 years ago
- Generate text line images for training deep learning OCR models☆912May 17, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- end2end layout analysis based seq2seq☆132Mar 8, 2021Updated 5 years ago
- Learning Generative Structure Prior for Blind Text Image Super-resolution [CVPR 2023]☆262Aug 18, 2025Updated 9 months ago
- OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commer…☆1,363May 20, 2026Updated 3 weeks ago
- Document Rectification and Illumination Correction using a Patch-based CNN☆396Sep 28, 2022Updated 3 years ago
- TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…☆742Aug 22, 2025Updated 9 months ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆646Aug 12, 2024Updated last year
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆953Aug 3, 2025Updated 10 months ago