kamperh / speech_dtwLinks
Dynamic time warping (DTW) functions for specifically speech alignment.
☆30Updated last year
Alternatives and similar repositories for speech_dtw
Users that are interested in speech_dtw are comparing it to the libraries listed below
Sorting:
- ☆22Updated 8 years ago
- Hybrid speech synthesiser☆28Updated 6 years ago
- Feature extraction for accented-speech or pathological speech☆17Updated 6 years ago
- Easier analysis of large speech corpora☆23Updated 4 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- Simple Kaldi recipe for forced alignment☆11Updated 2 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 6 years ago
- Long audio alignment using Kaldi☆23Updated 4 years ago
- ABX discrimination task in python☆45Updated last year
- ABX and kaldi experiments on speech corpora made easy☆33Updated last year
- Pulse Model vocoder☆42Updated 6 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 5 years ago
- ☆16Updated 6 years ago
- ☆27Updated 4 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆34Updated 6 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- Interspeech 2019 tutorial materials☆49Updated 6 years ago
- Filtering and Noise Adding Tool☆29Updated 3 years ago
- ☆40Updated 3 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 3 years ago
- Zero-Resource Speech Discovery, Search, and Evaluation Tools☆29Updated 10 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 8 years ago
- ☆17Updated 5 years ago
- Util code, issues, discussions☆29Updated 7 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Updated 3 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 6 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated last year
- Phonetically-Oriented Word Error Rate☆36Updated 6 years ago
- A fast cnn-based vocoder☆78Updated 5 years ago
- Attacking Speaker Recognition with Deep Generative Models☆34Updated 2 years ago