wilsonchingg / logmmse
LogMMSE speech enhancement/noise reduction
☆30Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for logmmse
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆87Updated 4 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆57Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆24Updated 4 years ago
- Feature extractor for DL speech processing.☆65Updated 2 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆79Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 4 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 3 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Updated 4 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆51Updated 4 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 5 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 3 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆39Updated 3 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Updated 6 years ago
- an tutorial implement of voice conversion using pytorch☆35Updated 6 years ago
- End-to-End Speech Recognition Using Tensorflow☆41Updated last year
- ☆28Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- ☆40Updated 2 years ago