kamperh / eskmeans
Embedded segmental K-means (ES-KMeans) in Python.
☆14Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for eskmeans
- CMU multilingual speech repository☆31Updated 2 years ago
- Vector Quantized Autoregressive Predictive Coding (VQ-APC)☆35Updated 4 years ago
- ☆22Updated 7 years ago
- ☆42Updated 6 years ago
- NIST SPH File reader (e.g. for TEDLIUM Corpus)☆25Updated 4 years ago
- All you need to get started for the Zero Speech Challenge 2017☆46Updated 5 years ago
- Stellenbosch University ZeroSpeech 2019 System☆10Updated 5 years ago
- ☆26Updated 3 years ago
- ☆36Updated 3 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated last year
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Updated last week
- A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…☆20Updated 3 years ago
- ☆32Updated 3 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆35Updated 3 years ago
- ☆39Updated 4 years ago
- ☆34Updated 5 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆38Updated 4 years ago
- Gaussian Mixture VAE Tacotron☆53Updated last year
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and…☆52Updated 6 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- Addressing the confounds of accompaniments in singer identification☆18Updated 4 years ago
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆31Updated 4 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 5 years ago
- working on parallel wavenet☆25Updated 6 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- A python implementation of the Griffin Lim Algorithm for audio reconstruction from magnitudes☆32Updated 10 months ago
- Raw waveform adaptation with SincNet☆11Updated 8 months ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago