drethage / speech-denoising-wavenet
A neural network for end-to-end speech denoising
☆673Updated last year
Related projects: ⓘ
- Speech Enhancement Generative Adversarial Network in TensorFlow☆809Updated last year
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.☆306Updated 2 years ago
- Speech Denoising with Deep Feature Losses☆183Updated 4 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆497Updated 2 years ago
- Deep learning for audio denoising☆644Updated 11 months ago
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆437Updated 2 months ago
- Speech Enhancement Generative Adversarial Network in PyTorch☆376Updated last year
- deep learning based speech enhancement using keras or pytorch, make it easy to use☆333Updated 4 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆835Updated 3 years ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆633Updated 3 years ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆959Updated last year
- Audio Denoising with Deep Network Priors☆163Updated 3 years ago
- Implementation of the Wave-U-Net for audio source separation☆824Updated last year
- A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…☆667Updated last year
- PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)☆515Updated 3 years ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆887Updated last year
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,541Updated 4 months ago
- Real-time GCC-NMF Blind Speech Separation and Enhancement☆314Updated 5 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆457Updated 4 years ago
- Python implementation of the Short Term Objective Intelligibility measure☆319Updated 8 months ago
- A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…☆705Updated 3 years ago
- A Python wrapper for the high-quality vocoder "World"☆718Updated 10 months ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆576Updated 2 years ago
- Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.☆567Updated last year
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,124Updated 3 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆368Updated 5 years ago
- Efficient neural speech synthesis☆1,123Updated last year
- An STFT/iSTFT for PyTorch.☆342Updated 10 months ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆853Updated last year
- A flexible source separation library in Python☆604Updated last year