guiem / train-tesseract
Dockerized example to train Tesseract v. 4
☆64Updated 2 years ago
Alternatives and similar repositories for train-tesseract
Users that are interested in train-tesseract are comparing it to the libraries listed below
Sorting:
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆80Updated 3 years ago
- Detect handwritten words (neural network based).☆70Updated 3 years ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆80Updated 2 years ago
- Detect textlines in document images☆93Updated 11 months ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆202Updated 4 months ago
- Pytorch implementation of our paper: Adapting OCR with Limited Labels☆60Updated last year
- A comprehensive tutorial for OCR in python using Tesseract-OCR and OpenCV☆120Updated 3 years ago
- CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)☆157Updated 2 years ago
- Train Tesseract LSTM with make☆675Updated 3 weeks ago
- Library used to deskew a scanned document☆461Updated 2 weeks ago
- Model for document segmentation trained on the midv-500-models dataset.☆76Updated 4 years ago
- Detect handwritten words (classic image processing based method).☆273Updated 2 years ago
- Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector☆264Updated 3 years ago
- This repository is to create tflite models for the available ocr models☆104Updated 4 years ago
- The deslanting algorithm sets text upright in images. Python, C++ and OpenCL implementations provided.☆150Updated 3 years ago
- Tutorial on how to deskew (straighten) text images☆51Updated 3 years ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆136Updated this week
- Document Scanner and Word Segmentation☆124Updated 4 years ago
- ☆138Updated last year
- Download and convert MIDV-500 annotations to COCO instance segmentation format☆88Updated 4 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆185Updated 5 months ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆104Updated 2 years ago
- Text Detection from images using OpenCV☆109Updated 4 years ago
- Detect and read handwritten words on scanned pages.☆119Updated last year
- Sample implementation of OCR metrics (CER, WER) calculation with TesseractOCR and fastwer☆28Updated 3 years ago
- Building OCR using YOLO and Tesseract☆94Updated 3 years ago
- Handwritten text recognition using transformers.☆158Updated 9 months ago
- BFSI sectors deal with lots of unstructured scanned documents which are archived in document management systems for further use.For examp…☆40Updated 3 years ago
- sambalshikhar / Document-Image-Classification-with-Intra-Domain-Transfer-Learning-and-Stacked-Generalization-of-DeepRVL-CDIP could be looked at as the equivalent of ImageNet for the document image community. It’s certainly the largest we’ve seen in the …☆18Updated 5 years ago
- Real-time detection of documents in images☆81Updated 8 months ago