wilsonchingg / logmmse
LogMMSE speech enhancement/noise reduction
☆30Updated 4 years ago
Alternatives and similar repositories for logmmse:
Users that are interested in logmmse are comparing it to the libraries listed below
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated 2 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆58Updated 2 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Updated 4 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- Text-to-Speech tutorial at SLTU 2016☆35Updated 8 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆34Updated 6 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- An implementation of RNN-Transducer loss in TF-2.0.☆45Updated 2 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Python library for audio augmentation☆83Updated last year
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- LogMMSE speech enhancement/noise reduction☆88Updated 5 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆25Updated 4 years ago
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- ☆40Updated 3 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆61Updated 4 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Sound Related Deep Learning Tasks boosting repository with pytorch☆87Updated 8 months ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- Generative Adversarial Networks for different impaired speech conversions☆36Updated last year
- Feature extractor for DL speech processing.☆65Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆30Updated 5 years ago