CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
☆790Apr 28, 2026Updated this week
Alternatives and similar repositories for CnSTD
Users that are interested in CnSTD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,755Feb 7, 2026Updated 2 months ago
- Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集☆34Dec 21, 2022Updated 3 years ago
- An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…☆3,111Feb 7, 2026Updated 2 months ago
- 基于cnstd+cnocr作为基础,封装的一个ocr的web服务☆10Nov 21, 2021Updated 4 years ago
- 基于Pytorch的OCR工具库,支持常用的文字检测和识别算法☆1,517Jan 4, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 文档方向分类☆221Feb 3, 2026Updated 3 months ago
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,376Jan 12, 2026Updated 3 months ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆295Sep 13, 2021Updated 4 years ago
- TDF-ICDAR 2019 Dataset for Typeset Math Formula Detection☆69Feb 9, 2020Updated 6 years ago
- FormulaNet is a new large-scale Mathematical Formula Detection dataset.☆21Nov 21, 2022Updated 3 years ago
- 📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.☆6,468Apr 27, 2026Updated last week
- yolo3+ocr☆6,120Aug 29, 2022Updated 3 years ago
- Scanning Single Shot Detector for Math in Document Images☆133Apr 18, 2023Updated 3 years ago
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,280Aug 14, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Analysis of Chinese and English layouts 中英文版面分析☆268Mar 24, 2026Updated last month
- 1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection(公式检测冠军方案)☆134Sep 4, 2023Updated 2 years ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,829Mar 17, 2026Updated last month
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆384Nov 3, 2024Updated last year
- 收集并整理有关OCR的数据集并统一标注格式,以便实验需要☆970Nov 28, 2023Updated 2 years ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆469Sep 28, 2025Updated 7 months ago
- 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reaso…☆1,295Jun 11, 2024Updated last year
- mxnet-Gluon implementation of PSENet text detector (Shape Robust Text Detection with Progressive Scale Expansion Network)☆18Jun 17, 2019Updated 6 years ago
- This repo is used to release the ArxivFormula dataset.☆35Nov 12, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆320Aug 15, 2025Updated 8 months ago
- The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.☆519Feb 1, 2026Updated 3 months ago
- 基于pytorch的ocr算法库,包括 psenet, pan, dbnet, sast , crnn☆680May 19, 2021Updated 4 years ago
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆76,903Apr 28, 2026Updated last week
- 利用语言模型,纠正OCR识别错误☆474May 22, 2023Updated 2 years ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆415Apr 23, 2026Updated last week
- PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)☆1,167Sep 11, 2025Updated 7 months ago
- A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".☆2,252Mar 11, 2024Updated 2 years ago
- Generate text line images for training deep learning OCR models☆910Apr 20, 2026Updated 2 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- end2end layout analysis based seq2seq☆132Mar 8, 2021Updated 5 years ago
- Learning Generative Structure Prior for Blind Text Image Super-resolution [CVPR 2023]☆264Aug 18, 2025Updated 8 months ago
- OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commer…☆1,338Apr 23, 2026Updated last week
- Document Rectification and Illumination Correction using a Patch-based CNN☆396Sep 28, 2022Updated 3 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆644Aug 12, 2024Updated last year
- TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…☆738Aug 22, 2025Updated 8 months ago
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆945Aug 3, 2025Updated 9 months ago