tesseract-ocr / tessdataLinks
Trained models with fast variant of the "best" LSTM models + legacy models
☆7,062Updated last year
Alternatives and similar repositories for tessdata
Users that are interested in tessdata are comparing it to the libraries listed below
Sorting:
- Best (most accurate) trained LSTM models.☆1,381Updated last year
- Tesseract Open Source OCR Engine (main repository)☆3,660Updated 3 weeks ago
- Source training data for Tesseract for lots of languages☆857Updated 3 months ago
- Tesseract Open Source OCR Engine (main repository)☆68,407Updated 3 weeks ago
- Tesseract documentation☆2,106Updated last month
- Fast integer versions of trained LSTM models☆557Updated 11 months ago
- A Python wrapper for Google Tesseract☆6,179Updated last month
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆1,939Updated last week
- Java JNA wrapper for Tesseract OCR API☆1,682Updated last week
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆27,362Updated 10 months ago
- A Python wrapper for the tesseract-ocr API☆2,109Updated 2 months ago
- Train Tesseract LSTM with make☆687Updated 3 months ago
- A framework like Celery!☆2Updated 2 years ago
- Box editor and trainer for Tesseract OCR☆244Updated last month
- Links to awesome OCR projects☆3,017Updated last year
- Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.☆858Updated last month
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,601Updated last month
- Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languag…☆51,849Updated last week
- yolo3+ocr☆6,090Updated 2 years ago
- Various documents related to Tesseract OCR☆265Updated 3 years ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,834Updated last year
- Download Poppler binaries packaged for Windows with dependencies☆872Updated 7 months ago
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.☆4,672Updated this week
- ABBYY Cloud OCR SDK☆518Updated 2 years ago
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,351Updated 8 months ago
- Community maintained fork of pdfminer - we fathom PDF☆6,601Updated 2 months ago
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,174Updated last year
- 开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~☆2,793Updated 2 years ago
- [DEPRECATED] Core Java Library + PDF/A, xtra and XML Worker. Only security fixes will be added — please use iText 7☆1,656Updated 9 months ago
- Open source Python library for converting PDF to DOCX.☆3,032Updated 2 months ago