drethage / speech-denoising-wavenetLinks
A neural network for end-to-end speech denoising
☆702Updated 2 years ago
Alternatives and similar repositories for speech-denoising-wavenet
Users that are interested in speech-denoising-wavenet are comparing it to the libraries listed below
Sorting:
- Speech Enhancement Generative Adversarial Network in TensorFlow☆847Updated 2 years ago
- Speech Enhancement Generative Adversarial Network in PyTorch☆404Updated 2 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆518Updated 3 years ago
- Deep learning for audio denoising☆737Updated 2 years ago
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.☆309Updated 3 years ago
- Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"☆369Updated 4 years ago
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆447Updated last year
- deep learning based speech enhancement using keras or pytorch, make it easy to use☆339Updated 5 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆592Updated 3 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆451Updated 5 years ago
- A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…☆733Updated 2 years ago
- Speech Denoising with Deep Feature Losses☆189Updated 5 years ago
- Voice Activity Detector in Python☆478Updated 4 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆864Updated 4 years ago
- A Python wrapper for the high-quality vocoder "World"☆768Updated 9 months ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆649Updated 5 years ago
- PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)☆517Updated 5 years ago
- Unofficial PyTorch implementation of Google AI's VoiceFilter system☆1,166Updated last year
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆936Updated last year
- Real-time GCC-NMF Blind Speech Separation and Enhancement☆324Updated 6 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆366Updated 6 years ago
- Voice Converter Using CycleGAN and Non-Parallel Data☆531Updated 2 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆473Updated 5 years ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,202Updated 4 years ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,015Updated 2 years ago
- The Implementation of FastSpeech based on pytorch.☆877Updated 2 years ago
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆559Updated last year
- Implementation of the Wave-U-Net for audio source separation☆914Updated 2 years ago
- The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.☆312Updated 3 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆378Updated 2 years ago