dshea89 / tesseract-retraining-pipeline
Intuitive interface for fine-tuning and retraining a Tesseract OCR language model
☆9Updated 3 months ago
Alternatives and similar repositories for tesseract-retraining-pipeline
Users that are interested in tesseract-retraining-pipeline are comparing it to the libraries listed below
Sorting:
- A TensorFlow implementation of hybird CNN-LSTM model with CTC loss for OCR problem☆32Updated 6 years ago
- ☆17Updated 4 years ago
- Handwritten text recognition with sequence-to-sequence architecture☆17Updated 2 years ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated last month
- PyTorch implementation of SuperTML: Two-Dimensional Word Embedding for the Precognition on Structured Tabular Data paper☆25Updated 9 months ago
- Official repository accompaying the ICDAR 2023 paper☆12Updated last year
- Generating Training Data Made Easy☆43Updated 4 years ago
- code and data for paper "One-shot Text Field Labeling using Attention and BeliefPropagation for Structure Information Extraction"☆61Updated 4 years ago
- Keras implementation of character-level sequence-to-sequence learning for spelling correction☆74Updated 6 years ago
- Embedding Visualizer (EmbedViz) data app made with Streamlit library☆22Updated 4 years ago
- Implementation of BertGrid : https://arxiv.org/abs/1909.04948☆30Updated last year
- ☆17Updated 4 years ago
- handwritten text recognition on IAM handwriting dataset☆15Updated 5 years ago
- fastai v3 part 2, fastai v2, and PyTorch v1 Course in Vienna v2☆20Updated 5 years ago
- ☆14Updated 3 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Updated 3 years ago
- Detect textlines in document images☆93Updated 11 months ago
- Neural Search System on Arxiv AI/ML Papers☆54Updated 3 years ago
- Example from my "Serverless Deep Learning" talk☆22Updated 4 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- Handwritten Number Recognition using CNN and Character Segmentation☆18Updated 7 years ago
- In an effort to decrease the execution time of the OCR process, a multi-processing script was created using Python's multi-processing mod…☆10Updated 5 years ago
- Model-Logger is a Python library for storing model's profile and rapid inter model comparison.☆61Updated 2 years ago
- my personal receipts collected all over the world☆73Updated 7 months ago
- Neural Deconvolutions in Tensorflow☆12Updated 5 years ago
- sambalshikhar / Document-Image-Classification-with-Intra-Domain-Transfer-Learning-and-Stacked-Generalization-of-DeepRVL-CDIP could be looked at as the equivalent of ImageNet for the document image community. It’s certainly the largest we’ve seen in the …☆18Updated 5 years ago
- ☆13Updated 7 years ago
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆31Updated last year
- An end to end Deep Learning Solution for table detection and structure recognition☆12Updated 4 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Updated 6 years ago