gullabi / STT-align
Coqui STT (πΈSTT) based forced alignment tool
β13Updated 2 years ago
Related projects β
Alternatives and complementary repositories for STT-align
- β11Updated 2 years ago
- β77Updated 6 months ago
- β17Updated last year
- A collection of utilities for handling IPA phones.β25Updated last year
- Python wrapper for phonetisaurus grapheme to phoneme toolβ12Updated 3 years ago
- Simple Kaldi recipe for forced alignmentβ10Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWβ¦β15Updated last year
- Workflow for forced alignment between languagesβ17Updated 9 months ago
- β32Updated 2 months ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Courtβ22Updated last year
- Word Error Rate Estimationβ10Updated 4 years ago
- β40Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITIONβ37Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.β26Updated 3 months ago
- Prosodic Speech Segmentation with Transformersβ23Updated 8 months ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Support tools for punctuation and boundary detection for ASR output.β57Updated last year
- pronunciation dictionaries for multiple languagesβ83Updated 7 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.β18Updated 8 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text toβ¦β42Updated 3 years ago
- Long audio alignment using Kaldiβ25Updated 3 years ago
- Pronunciation-assisted Subword Modelingβ29Updated 5 years ago
- β20Updated 6 years ago
- scripts to align a given wave to its transcription using trained models by Kaldiβ32Updated 5 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Koreanβ23Updated 6 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis latticesβ16Updated 8 months ago
- Grapheme to phoneme model for PyTorchβ40Updated 2 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"β17Updated 2 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.β16Updated 3 weeks ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authorsβ39Updated 3 months ago