rafayk7 / tesseractDataGeneratorLinks
Data Generator for Training Tesseract OCR
☆11Updated 4 years ago
Alternatives and similar repositories for tesseractDataGenerator
Users that are interested in tesseractDataGenerator are comparing it to the libraries listed below
Sorting:
- A tool that is built using several open source services and uses Open AI's GPT-2 as a base model.☆4Updated 2 years ago
- Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents☆12Updated 3 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated last month
- Document Image Classification☆11Updated 7 years ago
- Post-processing OCR errors with seq2seq models☆28Updated 4 years ago
- A selection of test lines of several early printed books as well as the corresponding individual OCRopus models and mixed models.☆10Updated 7 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17Updated 10 years ago
- Deeplearing based Reverse Image Search using Annoy library☆16Updated 6 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- Wrapper around pixel classifier☆9Updated 3 years ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Updated 3 years ago
- Question generation from text☆15Updated 12 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated 2 months ago
- ☆20Updated 5 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Updated 6 years ago
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- Uses Beautiful Soup to read Wiki pages, Gensim to summarize, NLTK to process, and extracts keywords based on entropy: everything in one b…☆9Updated 4 years ago
- Document Layout Analysis Projects☆23Updated 5 years ago
- Vocal, harmonic & percussive components are separated and percussive components are used for clasification.☆31Updated 3 years ago
- Python tools for Tesseract OCR training☆25Updated 3 years ago
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated 2 years ago
- work in progress - python Keras, Tensorflow, or Pytorch implementation of a chatbot or possibly smart-speaker☆18Updated last year
- Alphabot: a screen-less interactive spelling primer powered by computer vision☆14Updated 6 years ago
- Tool for sentiment analysis annotation☆12Updated 2 months ago
- 版面分析+OCR☆11Updated 3 years ago
- python ocr using tesseract/ with EAST opencv detector☆42Updated 10 months ago
- Reddit title generator API based on GPT-2☆19Updated 5 years ago
- Deep learning, Face detection, CNN, Tensorflow, Keras, OpenCV, Python crawler☆21Updated 7 years ago
- Next generation OCR engine based on LSTMs.☆52Updated 7 years ago