ricktorzynski / ocr-tesseract-dockerLinks
OCR using Python, Tesseract and OpenCV in a Docker container
☆125Updated 2 years ago
Alternatives and similar repositories for ocr-tesseract-docker
Users that are interested in ocr-tesseract-docker are comparing it to the libraries listed below
Sorting:
- Dockerized example to train Tesseract v. 4☆64Updated 3 years ago
- Train Tesseract LSTM with make☆705Updated 7 months ago
- Tesseract 4 OCR Runtime Environment - Docker Container☆101Updated 6 years ago
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆523Updated 4 years ago
- Python library to extract tabular data from images and scanned PDFs☆286Updated last year
- Extraction of machine-readable zone information from passports, visas and id-cards via OCR☆435Updated 9 months ago
- The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes…☆150Updated 6 years ago
- ☆147Updated 5 years ago
- BFSI sectors deal with lots of unstructured scanned documents which are archived in document management systems for further use.For examp…☆42Updated 4 years ago
- Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)☆198Updated 3 years ago
- BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on sc…☆114Updated 2 years ago
- Library used to deskew a scanned document☆493Updated last week
- Tesseract 4 traineddata for recognizing Seven Segment Display☆58Updated 6 years ago
- my personal receipts collected all over the world☆81Updated last year
- Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Docum…☆326Updated 2 years ago
- Demos, examples and utilities using PyMuPDF☆690Updated last year
- Extract tables from images or PDFs and convert them to Excel files☆127Updated 3 years ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆276Updated 5 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆404Updated last year
- a machine learning implementation of OCR☆98Updated 2 years ago
- Python interface to Apache PDFBox command-line tools.☆78Updated 2 years ago
- Code and procdures for handwriting object detection and recognition☆81Updated 5 years ago
- Building OCR using YOLO and Tesseract☆96Updated 4 years ago
- Document Layout Analysis☆392Updated this week
- A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.☆456Updated 2 years ago
- A package for signature detection☆56Updated 3 years ago
- CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)☆157Updated 3 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆110Updated 2 years ago
- NanoNets OCR API Example for Python☆207Updated 3 years ago
- A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents usi…☆503Updated 2 years ago