wilsonchingg / logmmseLinks
LogMMSE speech enhancement/noise reduction
☆30Updated 5 years ago
Alternatives and similar repositories for logmmse
Users that are interested in logmmse are comparing it to the libraries listed below
Sorting:
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated 2 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago
- Bayesian spEEch Recognizer☆55Updated 4 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Updated 4 years ago
- A fast cnn-based vocoder☆78Updated 5 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Updated 10 years ago
- End to End Dialect Identification using Convolutional Neural Network☆52Updated 5 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- Code for AccentDB.☆22Updated 4 years ago
- Real-Time High-Fidelity Speech Synthesis without GPU☆73Updated 11 months ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- ☆56Updated 6 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 6 years ago
- ☆31Updated 6 years ago
- Util code, issues, discussions☆29Updated 6 years ago
- Pytorch Implementation of FFTNet☆86Updated 7 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Updated 2 years ago
- Wavenet and its applications with Tensorflow☆55Updated 6 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 5 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 6 years ago
- An implementation of Tacotron and Tacotron2☆81Updated 3 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Updated 4 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆34Updated 7 years ago
- ☆57Updated 3 years ago