ricktorzynski / ocr-tesseract-docker
OCR using Python, Tesseract and OpenCV in a Docker container
☆123Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ocr-tesseract-docker
- Dockerized example to train Tesseract v. 4☆64Updated last year
- Train Tesseract LSTM with make☆639Updated 5 months ago
- Tesseract 4 OCR Runtime Environment - Docker Container☆97Updated 5 years ago
- The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes…☆147Updated 5 years ago
- Library used to deskew a scanned document☆418Updated last month
- Python library to extract tabular data from images and scanned PDFs☆264Updated 3 months ago
- Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)☆198Updated last year
- A comprehensive tutorial for OCR in python using Tesseract-OCR and OpenCV☆118Updated 2 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆370Updated 3 months ago
- Docker Image with latest Tesseract OCR Version 5.x.x built from sources☆30Updated last week
- Google Colab Demo of CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents☆46Updated 3 years ago
- Building OCR using YOLO and Tesseract☆91Updated 3 years ago
- Detect and read handwritten words on scanned pages.☆106Updated last year
- Document image dewarping library using a cubic sheet model☆117Updated this week
- A package for signature detection☆50Updated 2 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆101Updated last year
- Demos, examples and utilities using PyMuPDF☆578Updated 4 months ago
- A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.☆434Updated last year
- ☆46Updated 9 months ago
- Tutorial on how to deskew (straighten) text images☆50Updated 2 years ago
- Tesseract 4 OCR Compilation - Docker Container☆53Updated 2 years ago
- Document Layout Analysis☆350Updated this week
- BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on sc…☆105Updated last year
- Tesseract 4 traineddata for recognizing Seven Segment Display☆51Updated 5 years ago
- Extract tables from scanned documents pdf into csv file using ocr and image processing☆129Updated 5 years ago
- Locate and extract tables and figures in PDFs☆42Updated 3 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆180Updated last week
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆127Updated last week
- OCR with Google's AI technology (Cloud Vision API)☆68Updated last year
- Deploying a basic application on GCP, AWS and Azure☆59Updated last year