tesseract-ocr / tessdataLinks
Trained models with fast variant of the "best" LSTM models + legacy models
☆7,329Updated last year
Alternatives and similar repositories for tessdata
Users that are interested in tessdata are comparing it to the libraries listed below
Sorting:
- Best (most accurate) trained LSTM models.☆1,482Updated last year
- Source training data for Tesseract for lots of languages☆863Updated 9 months ago
- Tesseract Open Source OCR Engine (main repository)☆3,991Updated 3 months ago
- Tesseract documentation☆2,265Updated this week
- Tesseract Open Source OCR Engine (main repository)☆71,783Updated this week
- Fast integer versions of trained LSTM models☆587Updated last year
- Java JNA wrapper for Tesseract OCR API☆1,724Updated last week
- yolo3+ocr☆6,117Updated 3 years ago
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,711Updated 3 months ago
- Python-based tools for document analysis and OCR☆3,467Updated 4 years ago
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,252Updated 2 years ago
- Links to awesome OCR projects☆3,078Updated last year
- Line based ATR Engine based on OCRopy☆1,178Updated 7 months ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,929Updated last year
- Train Tesseract LSTM with make☆709Updated 8 months ago
- Fork of Tesseract Tools for Android☆3,776Updated 3 years ago
- 开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~☆2,845Updated 2 years ago
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,372Updated last month
- Various documents related to Tesseract OCR☆267Updated 4 years ago
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆28,700Updated last month
- Box editor and trainer for Tesseract OCR☆249Updated 3 weeks ago
- darknet text detect and darknet cnn ocr☆1,161Updated 4 years ago
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆67,509Updated this week
- ABBYY Cloud OCR SDK☆528Updated 2 years ago
- The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).☆3,492Updated last week
- Experimental optical character recognition app☆2,240Updated 7 years ago
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.☆5,579Updated this week
- make a better chinese character recognition OCR than tesseract☆1,514Updated 8 years ago
- A synthetic data generator for text recognition☆3,623Updated last year
- Collaboration with wangxupeng(https://github.com/wangxupeng)☆1,957Updated last year