rafayk7 / tesseractDataGenerator
Data Generator for Training Tesseract OCR
☆11Updated 4 years ago
Alternatives and similar repositories for tesseractDataGenerator:
Users that are interested in tesseractDataGenerator are comparing it to the libraries listed below
- A tool that is built using several open source services and uses Open AI's GPT-2 as a base model.☆4Updated 2 years ago
- Question generation from text☆15Updated 12 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Updated 6 years ago
- Document Image Classification☆11Updated 7 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated last week
- Post-processing OCR errors with seq2seq models☆28Updated 4 years ago
- Wrapper for Image functions which are called like in the PIL module but work internally with OpenCV☆26Updated 2 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- Image Pre-processing to improve OCR accuracy.☆20Updated 8 years ago
- python ocr using tesseract/ with EAST opencv detector☆42Updated 9 months ago
- A simple document scanner with OCR implemented using Python and OpenCV☆44Updated 4 years ago
- ☆20Updated 5 years ago
- python-timbl, originally developed by Sander Canisius, is a Python extension module wrapping the full TiMBL C++ programming interface. Wi…☆18Updated last week
- Deepiracy - Video piracy detection by using neural networks and string algorithms.☆33Updated 6 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated last month
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆131Updated 2 years ago
- Next generation OCR engine based on LSTMs.☆52Updated 7 years ago
- ☆16Updated 10 months ago
- A Reverse Image Search Engine☆30Updated 7 years ago
- A selection of test lines of several early printed books as well as the corresponding individual OCRopus models and mixed models.☆10Updated 7 years ago
- Document Layout Analysis Projects☆23Updated 5 years ago
- Example of building a working Spanish-to-English translation model with Marian NMT☆22Updated 5 years ago
- Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents☆12Updated 3 years ago
- Deeplearing based Reverse Image Search using Annoy library☆16Updated 6 years ago
- Extract structured data from PDF invoices☆13Updated 4 years ago
- This repo contains collection of various mini projects.☆13Updated 6 years ago
- Reading barcodes in complex images☆29Updated 8 years ago
- Transcribe audio to text with various Speech to Text Tools☆17Updated 5 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Updated 11 years ago