gumblex / tessdata_chiLinks
Retrained Tesseract OCR model for Chinese
☆133Updated 3 years ago
Alternatives and similar repositories for tessdata_chi
Users that are interested in tessdata_chi are comparing it to the libraries listed below
Sorting:
- Python bindings for WPS Office RPC (for Linux)☆275Updated 8 months ago
- Based on RapidOCR, extract the PDF content☆184Updated 7 months ago
- 🔎📖对中文PDF进行OCR | OCR for Chinese PDF file using API from DayBreak-u/chineseocr_lite☆110Updated last year
- CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包☆771Updated 5 months ago
- 【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。☆162Updated last year
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。基于 RapidOcrOnnx 。☆312Updated last year
- 手写文字擦除第1名方案,水印智能消除赛第1名☆170Updated last year
- an open high-performance Optical Character Recognition (OCR) toolkit☆304Updated 4 months ago
- Automatically exported from code.google.com/p/lingoes-extractor☆60Updated 9 years ago
- 开源的中英文离线 OCR,使用 PaddleOCR 实现,提供了简单的 Web 页面及接口☆131Updated 3 years ago
- 精选的中国开放文档格式(OFD)资源列表,包括标准规范、库、SDK、转换工具、阅读器和教程,为开发者和研究者提供全面参考。☆39Updated last year
- Convert the model in PaddleOCR to ONNX format☆108Updated 4 months ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆396Updated 3 months ago
- Anti OCR, Free Texts (拒绝被OCR,让文字得到自由)。把文本转换成机器无法识别但人可读的图片。☆53Updated 3 years ago
- A lunar calendar converter in Python, including 24 solar terms and a number of solar holidays and lunar holidays, mainly from China.☆81Updated last year
- 一个简易的mdx词典,支持中英文☆65Updated 2 months ago
- A simple way to deploy PaddleOCR based on FastAPI. (PaddleOCR 的 FastAPI 快速部署方案)☆158Updated 2 months ago
- CnOCR 是 Python 3 下的文字识别(Optical Character Recognition,简称OCR)工具包,支持简体中文、繁体中文(部分模型)、英文和数字的常见字符识别,支持竖排文字的识别。自带了20+个训练好的识别模型,适用于不同应用场景,安装后即可直…☆49Updated last year
- 《现代汉语词典》(第7版)全文TXT☆294Updated last year
- 图片搜索引擎,很简单。三步构建属于你自己的图片搜索引擎,掌握向量数据库和以图搜图、文本搜索图片。☆147Updated 2 years ago
- 多格式(word/excel/ppt转pdf/ofd, pdf/ofd相互转换)文档转换系统☆17Updated last year
- AI-OCR是基于PaddleOCR的OCR桌面客户端程序,支持Windows、Linux、MacOS等操作系统。 技术架构 前端界面:Electron + Reactjs + ArcoDesign OCR引擎:PaddleOCR + Pyinstaller 打包 前端和O…☆27Updated 3 years ago
- GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能☆180Updated last month
- 文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSha…☆112Updated last year
- “Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.☆252Updated 2 years ago
- ☆400Updated 4 months ago
- 文档方向分类☆225Updated last year
- 基于paddleOCR的nodejs库☆101Updated 3 months ago
- PPOCRLabel is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-…☆214Updated 2 years ago
- pretrained models for cnocr☆56Updated 4 years ago