stephengrice / synth-me
Basic concatenative text-to-speech implementation in Python
☆18Updated 5 years ago
Alternatives and similar repositories for synth-me:
Users that are interested in synth-me are comparing it to the libraries listed below
- Phoneme prediction from speech mel-spectrograms using RNN.☆13Updated 5 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 3 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆24Updated 4 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- ABX and kaldi experiments on speech corpora made easy☆31Updated 5 months ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 4 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆25Updated 2 years ago
- Official PyTorch implementation of TTS Style Transfer☆23Updated 2 years ago
- Singing voice detection☆16Updated 6 years ago
- Unsupervised word segmentation and clustering of speech☆13Updated 8 years ago
- A simple voice conversion tool☆17Updated 3 years ago
- ☆11Updated 3 years ago
- Util code, issues, discussions☆28Updated 6 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 5 years ago
- Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information☆27Updated 7 years ago
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"☆11Updated 5 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆26Updated last year
- ☆24Updated 6 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 5 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- Core code for my ICASSP 2018 paper☆53Updated 6 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Updated 6 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 6 years ago
- Code for AccentDB.☆20Updated 3 years ago
- PyTorch code to separate instruments from music using a low-latency neural network☆44Updated 5 years ago
- single channel speech separation for music vocal and accompany separate、voice reduce noise☆13Updated 5 years ago