yardstick17 / image_text_reader
The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-processed for better comprehension by OCR. This module first makes bounding box for text in images and then normalizes it to 300 dpi, suitable for OCR engine to read.
☆147Updated 6 years ago
Alternatives and similar repositories for image_text_reader:
Users that are interested in image_text_reader are comparing it to the libraries listed below
- Detect handwritten words (classic image processing based method).☆272Updated last year
- Detect handwritten words (neural network based).☆69Updated 3 years ago
- Detect and fix skew in images containing text☆264Updated 6 years ago
- Recognize handwritten text in scanned documents using MultiDimensional Recurrent Neural Networks☆46Updated 6 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆185Updated 4 months ago
- A simple program to extract the text from an image before performing OCR☆222Updated 4 years ago
- Code and procdures for handwriting object detection and recognition☆79Updated 4 years ago
- Line-level Handwritten Text Recognition (HTR) system implemented with TensorFlow.☆75Updated 2 years ago
- ☆16Updated 4 years ago
- A comprehensive tutorial for OCR in python using Tesseract-OCR and OpenCV☆119Updated 3 years ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆80Updated 3 years ago
- Page to PAGE Layout Analysis Tool☆191Updated 3 years ago
- Document Scanner and Word Segmentation☆123Updated 4 years ago
- Text detection with mainly MSER and SWT☆200Updated 5 months ago
- A simple document layout analysis using Python-OpenCV☆124Updated 4 years ago
- How to train a Keras model to recognize variable length text | DLology☆70Updated 6 years ago
- Retrive meaningful information from PAN Card image using tesseract-ocr☆124Updated 7 months ago
- Python scripts for segmentation of cursive handwritten image, and recognizing the characters using a CNN based model☆70Updated 5 years ago
- Using Tensorflow to classify the NIST Dataset 19 (Handwriting)☆41Updated 6 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆104Updated last year
- python ocr using tesseract/ with EAST opencv detector☆42Updated 9 months ago
- This a Deep learning AI system which recognize handwritten characters, Here I use chars74k data-set for training the model☆63Updated 7 years ago
- An application of high resolution GANs to dewarp images of perturbed documents☆140Updated 3 years ago
- The goal of this project is to solve the task of name transcription from handwriting images implementing a NN approach.☆58Updated 7 years ago
- Template based form extractor OCR. Train your own character and alphabet OCR.☆18Updated 6 years ago
- document image degradation☆163Updated 4 years ago
- Detect textlines in document images☆92Updated 10 months ago
- Extract text information from Aadhaar Card using tesseract-ocr☆131Updated 2 years ago
- Perspective recovery of text using transformed ellipses☆150Updated 3 years ago
- TensorFlow implementation of a segmentation system for document images.☆34Updated 6 years ago