gumblex / tessdata_chiLinks
Retrained Tesseract OCR model for Chinese
☆134Updated 3 years ago
Alternatives and similar repositories for tessdata_chi
Users that are interested in tessdata_chi are comparing it to the libraries listed below
Sorting:
- Python bindings for WPS Office RPC (for Linux)☆281Updated 9 months ago
- Based on RapidOCR, extract the PDF content☆184Updated 8 months ago
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。基于 RapidOcrOnnx 。☆317Updated 2 years ago
- 🔎📖对中文PDF进行OCR | OCR for Chinese PDF file using API from DayBreak-u/chineseocr_lite☆112Updated last year
- an open high-performance Optical Character Recognition (OCR) toolkit☆305Updated 6 months ago
- GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能☆181Updated 2 months ago
- 手写文字擦除第1名方案,水印智能消除赛第1名☆178Updated last year
- 【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。☆163Updated last year
- CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包☆777Updated 6 months ago
- CnOCR 是 Python 3 下的文字识别(Optical Character Recognition,简称OCR)工具包,支持简体中文、繁体中文(部分模型)、英文和数字的常见字符识别,支持竖排文字的识别。自带了20+个训练好的识别模型,适用于不同应用场景,安装后即可直…☆51Updated last year
- 开源的中英文离线 OCR,使用 PaddleOCR 实现,提供了简单的 Web 页面及接口☆130Updated 3 years ago
- Convert the model in PaddleOCR to ONNX format☆112Updated 6 months ago
- Fast integer versions of trained LSTM models☆593Updated last year
- 文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSha…☆117Updated last year
- 图片搜索引擎,很简单。三步构建属于你自己的图片搜索引擎,掌握向量数据库和以图搜图、文本搜索图片。☆150Updated 2 years ago
- A lunar calendar converter in Python, including 24 solar terms and a number of solar holidays and lunar holidays, mainly from China.☆82Updated last year
- FastAPI PaddleSpeech 音频录音转文字☆51Updated last year
- chineseocr lite onnx,超轻量级中文ocr demo,支持onnx推理 ( dbnet+crnn+anglenet)☆148Updated 2 years ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆407Updated 4 months ago
- 文档方向分类☆224Updated last year
- 精选的中国开放文档格式(OFD)资源列表,包括标准规范、库、SDK、转换工具、阅读器和教程,为开发者和研究者提供全面参考。☆44Updated last year
- “Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.☆254Updated 3 years ago
- Analysis of Chinese and English layouts 中英文版面分析☆259Updated 5 months ago
- A high-performance, open-source PDF data extraction tool. 一站式开源高性能数据提取工具,将复杂 PDF 文档转换为 Markdown 和 JSON 格式,使用onnx模型。☆94Updated 3 weeks ago
- pretrained models for cnocr☆58Updated 4 years ago
- Automatically exported from code.google.com/p/lingoes-extractor☆60Updated 9 years ago
- core for Final2x☆92Updated 3 months ago
- OCR自动化阅卷项目☆415Updated 4 months ago
- Phi3 中文后训练模型仓库☆324Updated last year
- 寿星天文历的C++实现版本☆239Updated last year