tesseract-ocr / tesstrainLinks
Train Tesseract LSTM with make
☆698Updated 5 months ago
Alternatives and similar repositories for tesstrain
Users that are interested in tesstrain are comparing it to the libraries listed below
Sorting:
- Library used to deskew a scanned document☆485Updated this week
- Best (most accurate) trained LSTM models.☆1,425Updated last year
- Line based ATR Engine based on OCRopy☆1,166Updated 4 months ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Updated 3 years ago
- OCR engine for all the languages☆889Updated last week
- ☆978Updated last year
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆1,467Updated last week
- An interactive document scanner built in Python using OpenCV featuring automatic corner detection, image sharpening, and color thresholdi…☆579Updated 2 years ago
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆521Updated 4 years ago
- OCR software for recognition of handwritten text☆817Updated 2 years ago
- Detect and fix skew in images containing text☆267Updated 6 years ago
- Fast integer versions of trained LSTM models☆567Updated last year
- Data used for LSTM model training☆122Updated last year
- This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet …☆518Updated 3 months ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆396Updated last year
- ☆146Updated 5 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆109Updated 2 years ago
- The deslanting algorithm sets text upright in images. Python, C++ and OpenCL implementations provided.☆150Updated 3 years ago
- Handwritten Text Recognition using TensorFlow☆282Updated last year
- Extract tables from scanned image PDFs using Optical Character Recognition.☆276Updated 5 years ago
- A synthetic data generator for text recognition☆3,571Updated last year
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆218Updated 9 months ago
- Detect handwritten words (classic image processing based method).☆274Updated 2 years ago
- Dockerized example to train Tesseract v. 4☆64Updated 2 years ago
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆1,963Updated last week
- Python library to extract tabular data from images and scanned PDFs☆283Updated last year
- A simple document layout analysis using Python-OpenCV☆126Updated 5 years ago
- A pure pytorch implemented ocr project including text detection and recognition☆602Updated 3 years ago
- finetuned traineddata files for tesseract 4.0.0 for testing☆169Updated 6 years ago
- CORD: A Consolidated Receipt Dataset for Post-OCR Parsing☆445Updated 3 years ago