artdevgame / tesseract-trainerLinks
Containerised version of tesseract v4 tools required for training a new font
☆13Updated 4 years ago
Alternatives and similar repositories for tesseract-trainer
Users that are interested in tesseract-trainer are comparing it to the libraries listed below
Sorting:
- ☆11Updated 4 years ago
- ☆14Updated 7 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Updated 2 years ago
- ☆49Updated 2 years ago
- Transformation spoken text to written text☆31Updated last year
- A python-based algorithm for id-card rectification☆52Updated last year
- A line-based framework to detect and extract tabular data in JSON format from raster images using computer vision and Tesseract OCR.☆59Updated 4 months ago
- Code-Switched translations with Large Language models☆24Updated last year
- ☆66Updated 2 years ago
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆23Updated last year
- A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition☆195Updated 3 months ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆154Updated 8 months ago
- ☆40Updated 4 years ago
- Finetuning Whisper ASR model for Belarusian language☆17Updated 11 months ago
- Object Detection Model for Scanned Documents☆94Updated 11 months ago
- Key information extraction from invoice document with Graph Convolution Network☆55Updated 2 years ago
- The task aims at extracting required fields in receipts captured by mobile devices☆34Updated 3 years ago
- ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for us…☆62Updated 10 months ago
- finetune llm part for spark-tts model☆120Updated 10 months ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆66Updated last year
- A synthesized dataset for Vietnamese TTS task☆66Updated 3 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Updated 4 years ago
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS☆57Updated 2 years ago
- Vietnamese Punctuation Prediction using Pretrained Language Models☆14Updated 3 years ago
- A python package for deep multilingual punctuation prediction.☆156Updated last year
- EraX Text to Speech base on F5-TTS Base V1☆79Updated 9 months ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆241Updated last year
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆98Updated 8 months ago
- Tool for enhancing noisy scanned text images☆48Updated 6 years ago
- ☆63Updated 7 months ago