CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
☆786Feb 7, 2026Updated 2 months ago
Alternatives and similar repositories for CnSTD
Users that are interested in CnSTD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,744Feb 7, 2026Updated 2 months ago
- Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集☆34Dec 21, 2022Updated 3 years ago
- An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…☆3,087Feb 7, 2026Updated 2 months ago
- 基于cnstd+cnocr作为基础,封装的一个ocr的web服务☆10Nov 21, 2021Updated 4 years ago
- 基于Pytorch的OCR工具库,支持常用的文字检测和识别算法☆1,515Jan 4, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 文档方向分类☆221Feb 3, 2026Updated 2 months ago
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,376Jan 12, 2026Updated 3 months ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆291Sep 13, 2021Updated 4 years ago
- TDF-ICDAR 2019 Dataset for Typeset Math Formula Detection☆69Feb 9, 2020Updated 6 years ago
- FormulaNet is a new large-scale Mathematical Formula Detection dataset.☆20Nov 21, 2022Updated 3 years ago
- yolo3+ocr☆6,119Aug 29, 2022Updated 3 years ago
- Scanning Single Shot Detector for Math in Document Images☆133Apr 18, 2023Updated 2 years ago
- 📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.☆6,288Updated this week
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,273Aug 14, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Analysis of Chinese and English layouts 中英文版面分析☆268Mar 24, 2026Updated 3 weeks ago
- 1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection(公式检测冠军方案)☆133Sep 4, 2023Updated 2 years ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,823Mar 17, 2026Updated 3 weeks ago
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆383Nov 3, 2024Updated last year
- 收集并整理有关OCR的数据集并统一标注格式,以便实验需要☆970Nov 28, 2023Updated 2 years ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆463Sep 28, 2025Updated 6 months ago
- 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reaso…☆1,292Jun 11, 2024Updated last year
- mxnet-Gluon implementation of PSENet text detector (Shape Robust Text Detection with Progressive Scale Expansion Network)☆18Jun 17, 2019Updated 6 years ago
- This repo is used to release the ArxivFormula dataset.☆35Nov 12, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆318Aug 15, 2025Updated 7 months ago
- The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.☆516Feb 1, 2026Updated 2 months ago
- 基于pytorch的ocr算法库,包括 psenet, pan, dbnet, sast , crnn☆679May 19, 2021Updated 4 years ago
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆75,347Apr 6, 2026Updated last week
- 利用语言模型,纠正OCR识别错误☆475May 22, 2023Updated 2 years ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆415Sep 4, 2025Updated 7 months ago
- PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)☆1,155Sep 11, 2025Updated 7 months ago
- A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".☆2,249Mar 11, 2024Updated 2 years ago
- Generate text line images for training deep learning OCR models☆905Mar 29, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- end2end layout analysis based seq2seq☆132Mar 8, 2021Updated 5 years ago
- OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commer…☆1,313Mar 2, 2026Updated last month
- Learning Generative Structure Prior for Blind Text Image Super-resolution [CVPR 2023]☆262Aug 18, 2025Updated 7 months ago
- Document Rectification and Illumination Correction using a Patch-based CNN☆395Sep 28, 2022Updated 3 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆641Aug 12, 2024Updated last year
- TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…☆729Aug 22, 2025Updated 7 months ago
- OCR toolbox from Davar-Lab☆761Nov 16, 2023Updated 2 years ago