nigelgward / midlevelLinks
Prosodic features for machine-learning applications, in Matlab.
☆15Updated 3 weeks ago
Alternatives and similar repositories for midlevel
Users that are interested in midlevel are comparing it to the libraries listed below
Sorting:
- Phonetically-Oriented Word Error Rate☆36Updated 6 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Updated last year
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- ☆22Updated 8 years ago
- Simple Kaldi recipe for forced alignment☆11Updated 2 years ago
- All you need to get started for the Zero Speech Challenge 2017☆47Updated 6 years ago
- Dynamic time warping (DTW) functions for specifically speech alignment.☆29Updated last year
- Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models☆15Updated 6 years ago
- ☆16Updated 5 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Updated last year
- Multilingual grapheme-to-phoneme conversion☆20Updated 7 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 5 years ago
- ☆27Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 4 years ago
- Word Error Rate Estimation☆15Updated 5 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 5 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 4 years ago
- Fork of the official kaldi.☆22Updated 3 years ago
- Zero-Resource Speech Discovery, Search, and Evaluation Tools☆29Updated 10 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Updated 3 years ago
- Feature extraction for accented-speech or pathological speech☆17Updated 6 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 6 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated 2 years ago
- Long audio alignment using Kaldi☆23Updated 4 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 6 months ago
- ☆13Updated 2 years ago
- ☆45Updated 6 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Updated 2 years ago
- ☆12Updated 7 years ago