gumblex / tessdata_chiLinks
Retrained Tesseract OCR model for Chinese
☆120Updated 3 years ago
Alternatives and similar repositories for tessdata_chi
Users that are interested in tessdata_chi are comparing it to the libraries listed below
Sorting:
- Python bindings for WPS Office RPC (for Linux)☆260Updated 5 months ago
- Based on RapidOCR, extract the PDF content☆182Updated 3 months ago
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。基于 RapidOcrOnnx 。☆284Updated last year
- 🔎📖对中文PDF进行OCR | OCR for Chinese PDF file using API from DayBreak-u/chineseocr_lite☆105Updated last year
- 【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。☆152Updated last year
- CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包☆758Updated 2 months ago
- an open high-performance Optical Character Recognition (OCR) toolkit☆283Updated last month
- CnOCR 是 Python 3 下的文字识别(Optical Character Recognition,简称OCR)工具包,支持简体中文、繁体中文(部分模型)、英文和数字的常见字符识别,支持竖排文字的识别。自带了20+个训练好的识别模型,适用于不同应用场景,安装后即可直…☆45Updated last year
- chineseocr lite onnx,超轻量级中文ocr demo,支持onnx推理 ( dbnet+crnn+anglenet)☆139Updated 2 years ago
- A lunar calendar converter in Python, including 24 solar terms and a number of solar holidays and lunar holidays, mainly from China.☆79Updated last year
- pretrained models for cnocr☆56Updated 3 years ago
- Convert the model in PaddleOCR to ONNX format☆99Updated last month
- 文档方向分类☆223Updated 9 months ago
- 开源的中英文离线 OCR,使用 PaddleOCR 实现,提供了简单的 Web 页面及接口☆124Updated 3 years ago
- “Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.☆248Updated 2 years ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆360Updated this week
- 精选的中国开放文档格式(OFD)资源列表,包括标准规范、库、SDK、转换工具、阅读器和教程,为开发者和研究者提供全面参考。☆32Updated 11 months ago
- Anti OCR, Free Texts (拒绝被OCR,让文字得到自由)。把文本转换成机器无法识别但人可读的图片。☆53Updated 2 years ago
- 图片搜索引擎,很简单。三步构建属于你自己的图片搜索引擎,掌握向量数据库和以图搜图、文本搜索图片。☆145Updated last year
- 《现代汉语词典》(第7版)全文TXT☆282Updated last year
- 手写文字擦除第1名方案,水印智能消除赛第1名☆152Updated last year
- 文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSha…☆87Updated last year
- GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能☆178Updated last month
- ☆47Updated 6 years ago
- ☆389Updated last month
- Remove embedded watermarks and color stains for scanned PDF. 去除扫描版 PDF 中的水印☆183Updated 9 years ago
- HivisionIDPhotos的cpp实现手机端部署离线部署证件照程序☆70Updated 9 months ago
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post…☆801Updated 3 weeks ago
- Automatically exported from code.google.com/p/lingoes-extractor☆60Updated 9 years ago
- A simple way to deploy PaddleOCR based on FastAPI. (PaddleOCR 的 FastAPI 快速部署方案)☆140Updated 5 months ago