guiem / train-tesseract
Dockerized example to train Tesseract v. 4
☆64Updated last year
Related projects ⓘ
Alternatives and complementary repositories for train-tesseract
- Detect textlines in document images☆90Updated 5 months ago
- Detect handwritten words (neural network based).☆66Updated 2 years ago
- The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes…☆147Updated 5 years ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆77Updated 2 years ago
- Building OCR using YOLO and Tesseract☆91Updated 3 years ago
- Pretrained mixed models to be used with Calamari.☆58Updated last month
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆127Updated last week
- Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector☆253Updated 2 years ago
- Tutorial on how to deskew (straighten) text images☆50Updated 2 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 3 months ago
- Sample implementation of OCR metrics (CER, WER) calculation with TesseractOCR and fastwer☆28Updated 3 years ago
- A TensorFlow implementation of hybird CNN-LSTM model with CTC loss for OCR problem☆33Updated 5 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆101Updated last year
- Train Tesseract LSTM with make☆639Updated 5 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆180Updated last week
- Master repository which includes most other OCR-D repositories as submodules☆72Updated last month
- Document Scanner and Word Segmentation☆119Updated 4 years ago
- OCR & Ground Truth Resources☆74Updated 2 years ago
- The deslanting algorithm sets text upright in images. Python, C++ and OpenCL implementations provided.☆146Updated 3 years ago
- OCR-D-compliant page segmentation☆67Updated 2 months ago
- Document Layout Analysis☆350Updated this week
- Working with hOCR in Javascript☆122Updated last year
- Detect handwritten words (classic image processing based method).☆265Updated last year
- ☆16Updated 3 years ago
- Page to PAGE Layout Analysis Tool☆191Updated 2 years ago
- Handwritten text recognition using transformers.☆154Updated 3 months ago
- python ocr using tesseract/ with EAST opencv detector☆42Updated 4 months ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆35Updated 11 months ago
- This repository is to create tflite models for the available ocr models☆101Updated 3 years ago
- A comprehensive tutorial for OCR in python using Tesseract-OCR and OpenCV☆118Updated 2 years ago