tesseract-ocr / tessdataLinks
Trained models with fast variant of the "best" LSTM models + legacy models
☆7,338Updated last year
Alternatives and similar repositories for tessdata
Users that are interested in tessdata are comparing it to the libraries listed below
Sorting:
- Source training data for Tesseract for lots of languages☆865Updated 9 months ago
- Best (most accurate) trained LSTM models.☆1,482Updated last year
- Tesseract documentation☆2,265Updated this week
- Tesseract Open Source OCR Engine (main repository)☆71,783Updated this week
- Fast integer versions of trained LSTM models☆587Updated last year
- Tesseract Open Source OCR Engine (main repository)☆3,991Updated 3 months ago
- A Python wrapper for Google Tesseract☆6,293Updated 2 weeks ago
- Train Tesseract LSTM with make☆709Updated 8 months ago
- A Python wrapper for the tesseract-ocr API☆2,140Updated this week
- Links to awesome OCR projects☆3,078Updated last year
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆28,700Updated last month
- Various documents related to Tesseract OCR☆267Updated 4 years ago
- Python-based tools for document analysis and OCR☆3,467Updated 4 years ago
- 开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~☆2,847Updated 2 years ago
- yolo3+ocr☆6,117Updated 3 years ago
- Line based ATR Engine based on OCRopy☆1,179Updated 7 months ago
- Clone of the mercurial repository http://zbar.hg.sourceforge.net:8000/hgroot/zbar/zbar☆2,537Updated last year
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,372Updated last month
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,929Updated last year
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆67,509Updated this week
- make a better chinese character recognition OCR than tesseract☆1,514Updated 8 years ago
- UI Automation Framework for Games and Apps☆9,059Updated last month
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,254Updated 2 years ago
- 验证码识别☆2,800Updated 3 years ago
- A Gtk/Qt front-end to tesseract-ocr.☆1,895Updated 2 weeks ago
- The open source embeddable online markdown editor (component).☆14,301Updated last year
- OCR engine for all the languages☆928Updated 3 weeks ago
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,711Updated 3 months ago
- Convert HTML to PDF using Webkit (QtWebKit)☆14,488Updated 3 years ago
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆8,785Updated this week