guiem / train-tesseract
Dockerized example to train Tesseract v. 4
☆64Updated 2 years ago
Alternatives and similar repositories for train-tesseract:
Users that are interested in train-tesseract are comparing it to the libraries listed below
- Tutorial on how to deskew (straighten) text images☆51Updated 2 years ago
- Detect handwritten words (neural network based).☆67Updated 2 years ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆80Updated 3 years ago
- A TensorFlow implementation of hybird CNN-LSTM model with CTC loss for OCR problem☆32Updated 5 years ago
- Document Scanner and Word Segmentation☆122Updated 4 years ago
- Detect textlines in document images☆91Updated 8 months ago
- Library used to deskew a scanned document☆438Updated last week
- Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector☆258Updated 2 years ago
- Train Tesseract LSTM with make☆655Updated 8 months ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆133Updated last month
- ☆16Updated 3 years ago
- Implemented the YOLO algorithm for scene text detection in keras-tensorflow (No object detection API used) The code can be tweaked to tra…☆153Updated 2 years ago
- The deslanting algorithm sets text upright in images. Python, C++ and OpenCL implementations provided.☆149Updated 3 years ago
- Sample implementation of OCR metrics (CER, WER) calculation with TesseractOCR and fastwer☆27Updated 3 years ago
- Detecting the National Identification Cards with Deep Learning (Faster R-CNN)☆298Updated 2 years ago
- The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes…☆146Updated 5 years ago
- Handwritten text recognition using transformers.☆155Updated 6 months ago
- Building OCR using YOLO and Tesseract☆93Updated 3 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆184Updated 2 months ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆104Updated last year
- BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on sc…☆107Updated 2 years ago
- CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)☆157Updated 2 years ago
- Code and procdures for handwriting object detection and recognition☆79Updated 4 years ago
- Data used for LSTM model training☆116Updated 11 months ago
- A comprehensive tutorial for OCR in python using Tesseract-OCR and OpenCV☆118Updated 2 years ago
- CVPR 2022: Table Structure Recognition☆39Updated 2 years ago
- Tool for enhancing noisy scanned text images☆48Updated 5 years ago
- Document Layout Analysis☆359Updated last month
- Perspective recovery of text using transformed ellipses☆149Updated 3 years ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated last year