linksense / ConvolutionaNeuralNetworksToEnhanceCodedSpeechLinks
In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral d…
☆28Updated 5 years ago
Alternatives and similar repositories for ConvolutionaNeuralNetworksToEnhanceCodedSpeech
Users that are interested in ConvolutionaNeuralNetworksToEnhanceCodedSpeech are comparing it to the libraries listed below
Sorting:
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆48Updated 5 years ago
- Keras implementation of speech enhancement based on LSGAN☆20Updated 8 years ago
- Neural Dereverberation☆36Updated 6 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆56Updated 2 years ago
- ☆54Updated 6 years ago
- An Experimental Study on Speech Enhancement based on DNN.☆14Updated 7 years ago
- ☆38Updated 5 years ago
- Audio samples for the paper "TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids"☆48Updated 5 years ago
- A neural network consist of cnn and lstm for speech enhancement☆25Updated 7 years ago
- ☆20Updated 5 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 6 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆96Updated 5 years ago
- A perceptual weighting filter loss for DNN training in speech enhancement☆24Updated 3 years ago
- This is a project on working/resolving the speech separation problem using supervised learning on various training targets, building mach…☆34Updated 8 years ago
- Distributed semi-constrained microphone arrays☆31Updated last year
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆38Updated 2 years ago
- ☆21Updated 6 years ago
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆44Updated 4 years ago
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆34Updated 4 years ago
- A PyTorch implementation of Conv-TasNet☆46Updated 6 years ago
- speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN☆35Updated 7 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Updated 7 years ago
- ☆34Updated 6 years ago
- Components loss for neural networks in mask-based speech enhancement☆33Updated 5 years ago
- Speech enhancement using mimic loss☆16Updated 6 years ago
- Generate audio signals corresponding to moving sources/receivers in a shoebox-shaped room (MATLAB)☆39Updated 5 years ago
- Filter Banks, Fast Python Implementation☆42Updated 3 years ago
- Time-domain Audio Separation Network☆24Updated 7 years ago
- A pitch tracker inspired by David Talkin's RAPT (Robust Algorithm for Pitch Tracking) written in Python.☆48Updated 9 years ago
- Time-domain Audio Separation Network (IN PYTORCH)☆23Updated 7 years ago