ricktorzynski / ocr-tesseract-dockerLinks
OCR using Python, Tesseract and OpenCV in a Docker container
☆125Updated 2 years ago
Alternatives and similar repositories for ocr-tesseract-docker
Users that are interested in ocr-tesseract-docker are comparing it to the libraries listed below
Sorting:
- Tesseract 4 OCR Runtime Environment - Docker Container☆101Updated 6 years ago
- Python library to extract tabular data from images and scanned PDFs☆283Updated last year
- A comprehensive tutorial for OCR in python using Tesseract-OCR and OpenCV☆126Updated 3 years ago
- Dockerized example to train Tesseract v. 4☆64Updated 2 years ago
- Library used to deskew a scanned document☆491Updated this week
- The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes…☆149Updated 6 years ago
- Extraction of machine-readable zone information from passports, visas and id-cards via OCR☆430Updated 8 months ago
- Working example for serving a ML model using FastAPI and Celery☆74Updated 4 years ago
- BFSI sectors deal with lots of unstructured scanned documents which are archived in document management systems for further use.For examp…☆41Updated 4 years ago
- NanoNets OCR API Example for Python☆204Updated 3 years ago
- Extract tables from images or PDFs and convert them to Excel files☆126Updated 2 years ago
- Google Colab Demo of CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents☆47Updated 4 years ago
- ☆37Updated 2 years ago
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆521Updated 4 years ago
- Demos, examples and utilities using PyMuPDF☆686Updated last year
- ☆147Updated 5 years ago
- ☆110Updated 2 years ago
- Train Tesseract LSTM with make☆701Updated 6 months ago
- my personal receipts collected all over the world☆81Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆402Updated last year
- Train Spacy ner with custom dataset☆182Updated 3 years ago
- BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on sc…☆113Updated 2 years ago
- Template based form extractor OCR. Train your own character and alphabet OCR.☆18Updated 7 years ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆275Updated 5 years ago
- ☆17Updated last year
- Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)☆198Updated 2 years ago
- A line-based framework to detect and extract tabular data in JSON format from raster images using computer vision and Tesseract OCR.☆57Updated last month
- Using Terran for creating video timelines☆134Updated 5 years ago
- Building OCR using YOLO and Tesseract☆96Updated 4 years ago
- A simple document detector in python3☆52Updated 2 years ago