tesseract-ocr / tessdataLinks
Trained models with fast variant of the "best" LSTM models + legacy models
☆7,227Updated last year
Alternatives and similar repositories for tessdata
Users that are interested in tessdata are comparing it to the libraries listed below
Sorting:
- Source training data for Tesseract for lots of languages☆859Updated 7 months ago
- Best (most accurate) trained LSTM models.☆1,453Updated last year
- Tesseract Open Source OCR Engine (main repository)☆70,652Updated 3 weeks ago
- Tesseract Open Source OCR Engine (main repository)☆3,855Updated 3 weeks ago
- Fast integer versions of trained LSTM models☆576Updated last year
- Tesseract documentation☆2,206Updated last month
- Train Tesseract LSTM with make☆701Updated 6 months ago
- Links to awesome OCR projects☆3,060Updated last year
- Python-based tools for document analysis and OCR☆3,467Updated 4 years ago
- Box editor and trainer for Tesseract OCR☆247Updated 4 months ago
- Line based ATR Engine based on OCRopy☆1,167Updated 5 months ago
- Various documents related to Tesseract OCR☆266Updated 4 years ago
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,686Updated last month
- yolo3+ocr☆6,106Updated 3 years ago
- Fork of Tesseract Tools for Android☆3,774Updated 3 years ago
- finetuned traineddata files for tesseract 4.0.0 for testing☆169Updated 6 years ago
- 开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~☆2,818Updated 2 years ago
- A curated list of promising OCR resources☆1,692Updated 3 years ago
- A browser automation framework and ecosystem.☆33,563Updated last week
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,895Updated last year
- A Gtk/Qt front-end to tesseract-ocr.☆1,851Updated last month
- OCR engine for all the languages☆903Updated last week
- The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).☆3,391Updated this week
- C-OCR是携程自研的OCR项目,主要包括身份证、护照、火车票、签证等旅游相关证件、材料的识别。 项目包含4个部分,拒识、检测、识别、后处理。☆2,463Updated last year
- Free Offline OCR 离线的中文文本检测+识别SDK☆1,367Updated 11 months ago
- darknet text detect and darknet cnn ocr☆1,164Updated 4 years ago
- A standalone Java Decompiler GUI☆14,834Updated last year
- Apache JMeter open-source load testing tool for analyzing and measuring the performance of a variety of services☆9,077Updated this week
- Run your own OCR-as-a-Service using Tesseract and Docker☆1,369Updated 2 years ago
- ABBYY Cloud OCR SDK☆524Updated 2 years ago