tesseract-ocr / tessdataLinks
Trained models with fast variant of the "best" LSTM models + legacy models
☆7,114Updated last year
Alternatives and similar repositories for tessdata
Users that are interested in tessdata are comparing it to the libraries listed below
Sorting:
- Best (most accurate) trained LSTM models.☆1,408Updated last year
- Source training data for Tesseract for lots of languages☆859Updated 5 months ago
- Tesseract Open Source OCR Engine (main repository)☆3,736Updated 2 months ago
- Tesseract Open Source OCR Engine (main repository)☆69,189Updated 3 weeks ago
- Fast integer versions of trained LSTM models☆563Updated last year
- Tesseract documentation☆2,152Updated 3 weeks ago
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆1,954Updated 3 weeks ago
- Java JNA wrapper for Tesseract OCR API☆1,700Updated last month
- Train Tesseract LSTM with make☆691Updated 4 months ago
- yolo3+ocr☆6,096Updated 3 years ago
- Python-based tools for document analysis and OCR☆3,463Updated 4 years ago
- Line based ATR Engine based on OCRopy☆1,159Updated 3 months ago
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,639Updated 2 months ago
- Links to awesome OCR projects☆3,037Updated last year
- 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M☆12,192Updated 2 years ago
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.☆4,913Updated this week
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,358Updated 9 months ago
- Various documents related to Tesseract OCR☆266Updated 3 years ago
- Box editor and trainer for Tesseract OCR☆245Updated 2 months ago
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆27,799Updated 11 months ago
- darknet text detect and darknet cnn ocr☆1,162Updated 3 years ago
- Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languag…☆53,341Updated this week
- ABBYY Cloud OCR SDK☆522Updated 2 years ago
- 开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~☆2,804Updated 2 years ago
- The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).☆3,185Updated this week
- mupdf mirror☆2,290Updated this week
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆1,463Updated last year
- A synthetic data generator for text recognition☆3,556Updated last year
- This is a sample Scrapy project for educational purposes☆1,340Updated last year
- Open source Python library for converting PDF to DOCX.☆3,078Updated 3 months ago