tesseract-ocr / tessdata_fast
Fast integer versions of trained LSTM models
☆503Updated 5 months ago
Alternatives and similar repositories for tessdata_fast:
Users that are interested in tessdata_fast are comparing it to the libraries listed below
- Best (most accurate) trained LSTM models.☆1,276Updated 10 months ago
- Train Tesseract LSTM with make☆653Updated 7 months ago
- Source training data for Tesseract for lots of languages☆845Updated 10 months ago
- Box editor and trainer for Tesseract OCR☆234Updated 6 months ago
- Various documents related to Tesseract OCR☆263Updated 3 years ago
- Line based ATR Engine based on OCRopy☆1,065Updated 2 months ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆375Updated 5 months ago
- Tesseract documentation☆1,901Updated last month
- Data used for LSTM model training☆116Updated 10 months ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆104Updated last year
- Tesseract Open Source OCR Engine (main repository)☆3,257Updated last month
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Updated 2 years ago
- OCR engine for all the languages☆767Updated this week
- finetuned traineddata files for tesseract 4.0.0 for testing☆159Updated 5 years ago
- ☆940Updated 2 years ago
- TableBank: A Benchmark Dataset for Table Detection and Recognition☆1,028Updated 5 months ago
- Library used to deskew a scanned document☆434Updated last week
- Links to awesome OCR projects☆2,866Updated 6 months ago
- ABBYY Cloud OCR SDK☆504Updated last year
- 📰 Binary distribution of PDFium☆950Updated this week
- Extract tables from scanned image PDFs using Optical Character Recognition.☆271Updated 4 years ago
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆511Updated 3 years ago
- Document Layout Analysis resources repos for development with PdfPig.☆598Updated last year
- ☆423Updated 2 years ago
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆1,415Updated 5 months ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆141Updated 3 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆592Updated 5 months ago
- Dockerized example to train Tesseract v. 4☆64Updated 2 years ago
- A Gtk/Qt front-end to tesseract-ocr.☆1,673Updated last week
- Retrained Tesseract OCR model for Chinese☆99Updated 2 years ago