gumblex / tessdata_chi
Retrained Tesseract OCR model for Chinese
☆103Updated 2 years ago
Alternatives and similar repositories for tessdata_chi:
Users that are interested in tessdata_chi are comparing it to the libraries listed below
- Based on RapidOCR, extract the PDF content.☆143Updated 5 months ago
- Convert the model in PaddleOCR to ONNX format☆75Updated 5 months ago
- 【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。☆121Updated 11 months ago
- Python bindings for WPS Office RPC (for Linux)☆233Updated 3 months ago
- 手写文字擦除第1名方案,水印智能消除赛第1名☆126Updated 9 months ago
- OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。基于 RapidOcrOnnx 。☆223Updated last year
- 《通用规范汉字表》是由中华人民共和国教育部、国家语言文字工作委员会联合组织研制的汉字使用规范, 2013年6月5日正式颁布,成为社会一般应用领域的汉字规范.☆56Updated 3 months ago
- 文档方向分类☆212Updated 3 months ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆258Updated 3 years ago
- ☆57Updated 7 years ago
- k2pdfopt library for koreader, based on http://willus.com/k2pdfopt☆96Updated 2 months ago
- ☆274Updated 2 months ago
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆156Updated 10 months ago
- OCR pre-processing Toolbox☆16Updated 2 years ago
- “Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.☆232Updated 2 years ago
- Fast integer versions of trained LSTM models☆513Updated 6 months ago
- PPOCRLabel is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-…☆201Updated last year
- CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包☆712Updated last month
- A simple way to deploy PaddleOCR based on FastAPI. (PaddleOCR 的 FastAPI 快速部署方案)☆109Updated this week
- PDF OCR Application, adds an OCR text layer to scanned PDF files, allowing them to be copied and searched.☆56Updated last year
- 开源的中英文离线 OCR,使用 PaddleOCR 实现,提供了简单的 Web 页面及接口☆118Updated 2 years ago
- MDict pack/unpack/list/info tool☆321Updated last month
- 《现代汉语词典》(第7版)全文TXT☆259Updated 7 months ago
- transformers ocr for chinese☆374Updated 2 years ago
- ☆43Updated 5 years ago
- 📰 Binary distribution of PDFium☆969Updated this week
- core for Final2x☆83Updated 2 months ago
- 基于transformer的ocr识别,在公章(印章识别, seal recognition)拓展应用☆179Updated 8 months ago
- Anti OCR, Free Texts (拒绝被OCR,让文字得到自由)。把文本转换成机器无法识别但人可读的图片。☆49Updated 2 years ago
- ☆106Updated 6 years ago