rafayk7 / tesseractDataGenerator
Data Generator for Training Tesseract OCR
☆11Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for tesseractDataGenerator
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Updated 5 years ago
- Wrapper for Image functions which are called like in the PIL module but work internally with OpenCV☆25Updated last year
- Build your own document scanner with OpenCV Python☆116Updated 4 years ago
- Open Collaborative AI Driven Parser builder for Web Scraping, Data Extraction and Crawling,Knowledge Graph☆17Updated 5 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- A tool to compare two directories and show diff in HTML☆23Updated 11 months ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆44Updated 7 months ago
- Deeplearing based Reverse Image Search using Annoy library☆16Updated 5 years ago
- Transcribe audio to text with various Speech to Text Tools☆17Updated 4 years ago
- resumerise: classify and summarizes resumes☆34Updated 3 years ago
- Post-processing OCR errors with seq2seq models☆28Updated 4 years ago
- Easy formatted text extraction from images using Google Vision API☆41Updated 3 years ago
- Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents☆12Updated 2 years ago
- A tool that is built using several open source services and uses Open AI's GPT-2 as a base model.☆4Updated last year
- • In this project, you will learn how to extract email and phone number from a business card or any document and save the output in a JSO…☆25Updated 5 years ago
- Content-Based Image Retrieval system (KTH DD2476 Project)☆9Updated 7 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 6 years ago
- A web application to process receipt images by Deep learning based OCR☆12Updated 3 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 3 months ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆37Updated 8 months ago
- ☆20Updated 5 years ago
- Python ffmpeg wrapper for audio and video editing (trim, subtitles/overlay, concat, merge, & more!)☆23Updated 5 years ago
- Segmenting Handwritten Paragraphs into Characters☆49Updated 5 years ago
- DFKI Layout Detection for OCR-D☆47Updated 2 weeks ago
- Python tools for Tesseract OCR training☆25Updated 2 years ago
- NLP course-based project: focus on translation of Chinese to English and Vietnamese to English.☆8Updated 5 years ago
- python ocr using tesseract/ with EAST opencv detector☆42Updated 4 months ago
- DeepLearningを利用して簡単に花の絵を描くツール☆18Updated 6 years ago
- Meaningful Optical Character Recognition from identity cards with Deep Learning.☆26Updated 3 years ago