gumblex / tessdata_chiLinks
Retrained Tesseract OCR model for Chinese
☆134Updated 3 years ago
Alternatives and similar repositories for tessdata_chi
Users that are interested in tessdata_chi are comparing it to the libraries listed below
Sorting:
- Python bindings for WPS Office RPC (for Linux)☆277Updated 9 months ago
- 【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。☆163Updated last year
- Based on RapidOCR, extract the PDF content☆184Updated 7 months ago
- 🔎📖对中文PDF进行OCR | OCR for Chinese PDF file using API from DayBreak-u/chineseocr_lite☆110Updated last year
- an open high-performance Optical Character Recognition (OCR) toolkit☆305Updated 5 months ago
- CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包☆773Updated 6 months ago
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。基于 RapidOcrOnnx 。☆313Updated 2 years ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆400Updated 3 months ago
- 开源的中英文离线 OCR,使用 PaddleOCR 实现,提供了简单的 Web 页面及接口☆131Updated 3 years ago
- Fast integer versions of trained LSTM models☆586Updated last year
- pretrained models for cnocr☆57Updated 4 years ago
- A lunar calendar converter in Python, including 24 solar terms and a number of solar holidays and lunar holidays, mainly from China.☆82Updated last year
- 文档方向分类☆224Updated last year
- “Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.☆253Updated 2 years ago
- 通过paddle ocr实现pdf转markdown☆78Updated last year
- 文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSha…☆116Updated last year
- 手写文字擦除第1名方案,水印智能消除赛第1名☆175Updated last year
- 《现代汉语词典》(第7版)全文TXT☆298Updated last year
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆373Updated last year
- Analysis of Chinese and English layouts 中英文版面分析☆259Updated 4 months ago
- Convert the model in PaddleOCR to ONNX format☆112Updated 5 months ago
- 寿星天文历的C++实现版本☆240Updated last year
- 图片搜索引擎,很简单。三步构建属于你自己的图片搜索引擎,掌握向量数据库和以图搜图、文本搜索图片。☆148Updated 2 years ago
- OCR自动化阅卷项目☆403Updated 3 months ago
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆906Updated 5 months ago
- 常用中文字体,可在Linux环境自用。☆450Updated 3 years ago
- ☆48Updated 6 years ago
- ☆404Updated 5 months ago
- A simple way to deploy PaddleOCR based on FastAPI. (PaddleOCR 的 FastAPI 快速部署方案)☆159Updated 2 months ago
- The latest SQLite version of the China Biographical Database☆146Updated last month