ricktorzynski / ocr-tesseract-dockerLinks
OCR using Python, Tesseract and OpenCV in a Docker container
☆125Updated 2 years ago
Alternatives and similar repositories for ocr-tesseract-docker
Users that are interested in ocr-tesseract-docker are comparing it to the libraries listed below
Sorting:
- Dockerized example to train Tesseract v. 4☆64Updated 2 years ago
- A comprehensive tutorial for OCR in python using Tesseract-OCR and OpenCV☆124Updated 3 years ago
- Python library to extract tabular data from images and scanned PDFs☆281Updated last year
- Train Tesseract LSTM with make☆690Updated 4 months ago
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆521Updated 4 years ago
- ☆254Updated last year
- Working example for serving a ML model using FastAPI and Celery☆75Updated 3 years ago
- Tesseract 4 OCR Runtime Environment - Docker Container☆101Updated 6 years ago
- Extraction of machine-readable zone information from passports, visas and id-cards via OCR☆417Updated 5 months ago
- A docker-compose file for launching Jupyter Notebooks in a container.☆156Updated 7 years ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆275Updated 5 years ago
- Detecting the National Identification Cards with Deep Learning (Faster R-CNN)☆307Updated 2 years ago
- Train Spacy ner with custom dataset☆182Updated 2 years ago
- BFSI sectors deal with lots of unstructured scanned documents which are archived in document management systems for further use.For examp…☆41Updated 4 years ago
- Run python script as a cron job using Docker☆98Updated 2 years ago
- ☆146Updated 5 years ago
- Python interface to Apache PDFBox command-line tools.☆77Updated 2 years ago
- ☆36Updated last year
- NanoNets OCR API Example for Python☆201Updated 3 years ago
- The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes…☆147Updated 6 years ago
- Extract tables from images or PDFs and convert them to Excel files☆125Updated 2 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆395Updated last year
- A Python tool to help extracting information from structured PDFs.☆412Updated 2 weeks ago
- Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)☆198Updated 2 years ago
- A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents usi…☆495Updated 2 years ago
- a machine learning implementation of OCR☆98Updated 2 years ago
- ☆376Updated last year
- Building OCR using YOLO and Tesseract☆96Updated 3 years ago
- Library used to deskew a scanned document☆478Updated this week
- Parsing pdf tables using YOLOV3☆118Updated 4 years ago