dshea89 / tesseract-retraining-pipelineLinks
Intuitive interface for fine-tuning and retraining a Tesseract OCR language model
☆10Updated 5 months ago
Alternatives and similar repositories for tesseract-retraining-pipeline
Users that are interested in tesseract-retraining-pipeline are comparing it to the libraries listed below
Sorting:
- ☆17Updated 4 years ago
- Finetune LayoutLM on SROIE dataset using W&B tools☆19Updated 4 years ago
- An end to end Deep Learning Solution for table detection and structure recognition☆12Updated 4 years ago
- Official repository accompaying the ICDAR 2023 paper☆12Updated 2 years ago
- A TensorFlow implementation of hybird CNN-LSTM model with CTC loss for OCR problem☆33Updated 6 years ago
- ☆10Updated 7 years ago
- code and data for paper "One-shot Text Field Labeling using Attention and BeliefPropagation for Structure Information Extraction"☆61Updated 5 years ago
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Updated 4 years ago
- Generative Adversarial Network (GAN) that generates logo images.☆38Updated 6 years ago
- Handwritten text recognition with sequence-to-sequence architecture☆17Updated 2 years ago
- Keras implementation of character-level sequence-to-sequence learning for spelling correction☆73Updated 6 years ago
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆31Updated 2 years ago
- Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks☆43Updated 6 years ago
- Google Colab Demo of CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents☆47Updated 4 years ago
- Document Classification and Post-OCR Key Value Extraction☆63Updated 6 years ago
- A series of notebooks demonstrating how to build simple NLP web apps with Gradio and Hugging Face transformers☆44Updated 2 months ago
- In an effort to decrease the execution time of the OCR process, a multi-processing script was created using Python's multi-processing mod…☆10Updated 6 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Updated last year
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆38Updated 2 years ago
- PyTorch implementation of SuperTML: Two-Dimensional Word Embedding for the Precognition on Structured Tabular Data paper☆25Updated last year
- This repo contains code to convert Structured Documents to Graphs and implement a Graph Convolution Neural Network for node classificatio…☆146Updated 3 years ago
- TensorFlow implementation of ScrabbleGAN (ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation, CVPR 2020)☆63Updated 3 years ago
- Neural Search System on Arxiv AI/ML Papers☆54Updated 4 years ago
- TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…☆62Updated 3 years ago
- python ocr using tesseract/ with EAST opencv detector☆42Updated last year
- Applying progressive resizing to building models in Keras.☆18Updated 6 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Updated 4 years ago
- ☆20Updated 3 years ago
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"☆25Updated 2 years ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆21Updated 8 months ago