gullabi / STT-alignLinks
Coqui STT (πΈSTT) based forced alignment tool
β13Updated 3 years ago
Alternatives and similar repositories for STT-align
Users that are interested in STT-align are comparing it to the libraries listed below
Sorting:
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWβ¦β18Updated 2 years ago
- Simple Kaldi recipe for forced alignmentβ10Updated last year
- A handy dataset of noises for ASRβ21Updated 6 years ago
- β17Updated 4 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Courtβ22Updated 2 years ago
- A collection of utilities for handling IPA phones.β25Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.β13Updated 4 years ago
- ARPABET transcription syllabifier moduleβ14Updated 2 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.β15Updated 5 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-coreβ15Updated 2 years ago
- phone inventory libraryβ16Updated 2 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.β26Updated 11 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text toβ¦β45Updated 4 years ago
- Phonetically-Oriented Word Error Rateβ35Updated 6 years ago
- β17Updated 2 years ago
- Convert words to numbersβ20Updated 3 years ago
- β12Updated 2 years ago
- β11Updated 3 weeks ago
- β22Updated 3 years ago
- A Visualizer for prosodically annotated speech corporaβ12Updated 3 years ago
- Long audio alignment using Kaldiβ23Updated 4 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITIONβ42Updated 2 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.β16Updated 3 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)β9Updated 4 years ago
- Transfer learning approach to pronunciation scoringβ10Updated last year
- Deepspeech ASR Model for the Catalan Languageβ17Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Updated 4 years ago
- β40Updated 3 years ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectrβ¦β15Updated 2 years ago