kamperh / speech_dtw
Dynamic time warping (DTW) functions for specifically speech alignment.
☆27Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for speech_dtw
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- ☆22Updated 7 years ago
- Hybrid speech synthesiser☆28Updated 5 years ago
- ☆40Updated 2 years ago
- ☆48Updated 3 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 4 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Updated 5 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- Unsupervised word segmentation and clustering of speech☆13Updated 7 years ago
- ABX and kaldi experiments on speech corpora made easy☆31Updated last month
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 2 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 6 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆14Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 4 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Updated 6 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 4 months ago
- Custom decoders for Kaldi☆80Updated 5 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆13Updated 8 years ago
- Language identification using Siamese network based on i-vector☆7Updated 7 years ago
- An implementation of RNN-Transducer loss in TF-2.0.☆45Updated last year
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Updated 5 years ago
- readers that enable reading kaldi ark in tensorflow☆17Updated 6 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Updated 5 years ago
- Recurrent Neural Aligner☆49Updated 4 years ago
- it's ASR decoder and make graph project☆32Updated 2 years ago