kamperh / segmentalistLinks
Unsupervised word segmentation and clustering of speech
☆13Updated 8 years ago
Alternatives and similar repositories for segmentalist
Users that are interested in segmentalist are comparing it to the libraries listed below
Sorting:
- Dialect identification using Siamese network☆15Updated 7 years ago
- ABX and kaldi experiments on speech corpora made easy☆33Updated last year
- Multilingual grapheme-to-phoneme conversion☆20Updated 7 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆14Updated 9 years ago
- Dynamic time warping (DTW) functions for specifically speech alignment.☆30Updated last year
- readers that enable reading kaldi ark in tensorflow☆17Updated 7 years ago
- ☆22Updated 8 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 5 years ago
- Zero-Resource Speech Discovery, Search, and Evaluation Tools☆29Updated 10 years ago
- All you need to get started for the Zero Speech Challenge 2017☆47Updated 6 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 6 months ago
- ☆45Updated 6 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 4 years ago
- Network specification and demo☆35Updated 8 years ago
- Bayesian spEEch Recognizer☆55Updated 4 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Updated 5 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Updated 7 years ago
- This repository contains the files used for our Interspeech 2017 paper.☆16Updated 8 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 6 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Updated 5 years ago
- Hybrid speech synthesiser☆28Updated 6 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Updated 10 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆23Updated 6 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Updated 3 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 6 years ago
- Unsupervised Speaker Clustering & Speaker Recognition☆13Updated 6 years ago
- audio cfeatures extraction tool from wav to h5features format☆19Updated 6 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Updated 5 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 6 years ago