artdevgame / tesseract-trainerLinks
Containerised version of tesseract v4 tools required for training a new font
☆13Updated 3 years ago
Alternatives and similar repositories for tesseract-trainer
Users that are interested in tesseract-trainer are comparing it to the libraries listed below
Sorting:
- ☆38Updated 3 years ago
- ☆12Updated 3 years ago
- ☆11Updated last week
- ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription☆48Updated 2 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆35Updated last month
- ☆14Updated 6 years ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆58Updated 7 months ago
- finetune llm part for spark-tts model☆99Updated 3 months ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆55Updated 9 months ago
- ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for us…☆59Updated 4 months ago
- ☆157Updated 7 months ago
- A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition☆180Updated last week
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆22Updated 11 months ago
- Transformation spoken text to written text☆30Updated last year
- Finetuning Whisper ASR model for Belarusian language☆17Updated 5 months ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 2 years ago
- TTS models for Arabic (Tacotron2, FastPitch)☆119Updated 8 months ago
- TTS for Arabic (FastPitch, Mixer-TTS) in the ONNX format☆25Updated 3 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆82Updated 8 months ago
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆12Updated 11 months ago
- VietTTS: An Open-Source Vietnamese Text to Speech☆58Updated 7 months ago
- Triton backend for https://github.com/OpenNMT/CTranslate2☆11Updated 11 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- ☆47Updated 2 years ago
- A synthesized dataset for Vietnamese TTS task☆63Updated 3 years ago
- Use quantized versions of Whisper to speed up inference☆12Updated 9 months ago
- Document image dewarping library using a cubic sheet model☆165Updated this week
- PAFTS : Library That Preprocessing Audio For TTS.☆21Updated 8 months ago