gullabi / STT-alignLinks
Coqui STT (πΈSTT) based forced alignment tool
β13Updated 3 years ago
Alternatives and similar repositories for STT-align
Users that are interested in STT-align are comparing it to the libraries listed below
Sorting:
- β12Updated 2 years ago
- Deepspeech ASR Model for the Catalan Languageβ17Updated 4 years ago
- Proposed splits for the LREC Wikipron paperβ14Updated 5 years ago
- Simple Kaldi recipe for forced alignmentβ11Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWβ¦β18Updated 2 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Courtβ22Updated 2 years ago
- Phonetically-Oriented Word Error Rateβ36Updated 6 years ago
- Python wrapper for phonetisaurus grapheme to phoneme toolβ12Updated 4 years ago
- phone inventory libraryβ16Updated 2 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-coreβ15Updated 2 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.β15Updated 5 years ago
- Pronounce Arabic wordsβ19Updated 6 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.β18Updated 3 years ago
- β10Updated 4 years ago
- A handy dataset of noises for ASRβ22Updated 6 years ago
- Long audio alignment using Kaldiβ23Updated 4 years ago
- Transfer learning approach to pronunciation scoringβ10Updated last year
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)β10Updated 4 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITIONβ43Updated 2 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Expected edit distance implementation using OpenFst toolsβ11Updated 10 years ago
- Dynamic time warping (DTW) functions for specifically speech alignment.β28Updated last year
- Easier analysis of large speech corporaβ23Updated 4 years ago
- β11Updated 2 weeks ago
- Multilingual Grapheme to Phonemeβ50Updated 9 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learningβ40Updated 3 years ago
- β22Updated 3 years ago
- pronunciation dictionaries for multiple languagesβ90Updated 8 years ago
- BurrMill coreβ21Updated 3 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Koreanβ23Updated 7 years ago