ricktorzynski / ocr-tesseract-docker
OCR using Python, Tesseract and OpenCV in a Docker container
☆124Updated 2 years ago
Alternatives and similar repositories for ocr-tesseract-docker
Users that are interested in ocr-tesseract-docker are comparing it to the libraries listed below
Sorting:
- A comprehensive tutorial for OCR in python using Tesseract-OCR and OpenCV☆120Updated 3 years ago
- Dockerized example to train Tesseract v. 4☆64Updated 2 years ago
- Python program to recognize Text from Images using Google's tesseract-ocr☆28Updated 2 years ago
- BFSI sectors deal with lots of unstructured scanned documents which are archived in document management systems for further use.For examp…☆40Updated 3 years ago
- Library used to deskew a scanned document☆461Updated 2 weeks ago
- Python library to extract tabular data from images and scanned PDFs☆278Updated 9 months ago
- Extraction of machine-readable zone information from passports, visas and id-cards via OCR☆402Updated 2 months ago
- ☆31Updated last year
- Train Tesseract LSTM with make☆675Updated 3 weeks ago
- Ready-to-use MRZ / MRTD (Machine-readable zone/travel documents) dataset and models for tesseract v4☆97Updated 5 years ago
- The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes…☆147Updated 6 years ago
- Perspective recovery of text using transformed ellipses☆150Updated 3 years ago
- Docker Image with latest Tesseract OCR Version 5.x.x built from sources☆37Updated 2 weeks ago
- NanoNets OCR API Example for Python☆185Updated 3 years ago
- Code and procdures for handwriting object detection and recognition☆79Updated 4 years ago
- Tesseract 4 OCR Compilation - Docker Container☆54Updated 3 years ago
- BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on sc…☆109Updated 2 years ago
- Tutorial on how to deskew (straighten) text images☆51Updated 3 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆391Updated 9 months ago
- This Repository contains a Jupyter notebook explaining how to detect checkboxes/table cells from a scanned image☆31Updated 4 years ago
- Python interface to Apache PDFBox command-line tools.☆75Updated 2 years ago
- tesseract 4 traineddata for MRZ using OCR-B fonts☆79Updated 5 years ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆273Updated 4 years ago
- Detecting the National Identification Cards with Deep Learning (Faster R-CNN)☆303Updated 2 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆104Updated 2 years ago
- Text Detection from images using OpenCV☆109Updated 4 years ago
- ☆13Updated 3 years ago
- A simple document detector in python3☆51Updated 2 years ago
- Tensorflow, Luminoth Based Table Detection and Extraction☆163Updated 2 years ago
- sambalshikhar / Document-Image-Classification-with-Intra-Domain-Transfer-Learning-and-Stacked-Generalization-of-DeepRVL-CDIP could be looked at as the equivalent of ImageNet for the document image community. It’s certainly the largest we’ve seen in the …☆18Updated 5 years ago