drethage / speech-denoising-wavenet
A neural network for end-to-end speech denoising
☆692Updated last year
Alternatives and similar repositories for speech-denoising-wavenet
Users that are interested in speech-denoising-wavenet are comparing it to the libraries listed below
Sorting:
- Speech Enhancement Generative Adversarial Network in TensorFlow☆841Updated 2 years ago
- Speech Enhancement Generative Adversarial Network in PyTorch☆393Updated last year
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆441Updated 10 months ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆508Updated 3 years ago
- Audio Denoising with Deep Network Priors☆162Updated 4 years ago
- deep learning based speech enhancement using keras or pytorch, make it easy to use☆335Updated 5 years ago
- Deep learning based speech source separation using Pytorch☆316Updated 4 years ago
- A Python wrapper for the high-quality vocoder "World"☆749Updated 3 months ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆470Updated 5 years ago
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.☆310Updated 3 years ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆644Updated 4 years ago
- Deep learning for audio denoising☆706Updated last year
- Speech Denoising with Deep Feature Losses☆186Updated 4 years ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆1,006Updated last year
- Different implementations of "Weighted Prediction Error" for speech dereverberation☆519Updated last month
- A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…☆707Updated 2 years ago
- Voice Converter Using CycleGAN and Non-Parallel Data☆529Updated last year
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆963Updated last year
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,606Updated last year
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆861Updated last year
- Python implementation of the Short Term Objective Intelligibility measure☆339Updated last year
- g2p: English Grapheme To Phoneme Conversion☆850Updated 2 years ago
- A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…☆763Updated 4 years ago
- Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"☆368Updated 3 years ago
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆219Updated 2 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- A must-read paper for speech separation based on neural networks☆779Updated 3 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆584Updated 3 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆367Updated 6 years ago
- PyTorch implementation of Tacotron speech synthesis model.☆310Updated 5 years ago