In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral d…
☆28Mar 8, 2020Updated 6 years ago
Alternatives and similar repositories for ConvolutionaNeuralNetworksToEnhanceCodedSpeech
Users that are interested in ConvolutionaNeuralNetworksToEnhanceCodedSpeech are comparing it to the libraries listed below
Sorting:
- Keras implementation of speech enhancement based on LSGAN☆20Dec 10, 2017Updated 8 years ago
- Convolutional neural nets for single channel speech enhancement☆144Dec 15, 2020Updated 5 years ago
- Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)☆92Jul 22, 2019Updated 6 years ago
- Components loss for neural networks in mask-based speech enhancement☆33Nov 20, 2020Updated 5 years ago
- (tensorflow) Wiener Filter based Speech Enhancement(LSTM/BLSTM, GRU/BGRU, Transformer)☆15Dec 3, 2019Updated 6 years ago
- ☆27Apr 12, 2018Updated 7 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Oct 10, 2019Updated 6 years ago
- A neural network consist of cnn and lstm for speech enhancement☆25Aug 2, 2018Updated 7 years ago
- Audio Signal Processing Python Tools☆50Jun 15, 2017Updated 8 years ago
- An Attention-based Neural Network Approach for Single Channel Speech Enhancement☆25Dec 1, 2019Updated 6 years ago
- DDAE speech enhancement on spectrogram domain using Keras☆25Aug 21, 2017Updated 8 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆38Mar 24, 2023Updated 2 years ago
- Real-time Python framework for DNN-based speech enhancement☆27Mar 23, 2020Updated 5 years ago
- DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)☆17Aug 31, 2017Updated 8 years ago
- ☆55Jul 21, 2019Updated 6 years ago
- Language modelling for sound event detection☆20Jan 2, 2020Updated 6 years ago
- A Conditional Generative Adverserial Network (cGAN) was adapted for the task of source de-noising of noisy voice auditory images. The bas…☆45Mar 17, 2018Updated 8 years ago
- Speech Enhancement using Bayesian WaveNet☆98Apr 1, 2018Updated 7 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆62Sep 24, 2021Updated 4 years ago
- Task 4 Large-scale weakly supervised sound event detection for smart cars☆68Dec 20, 2021Updated 4 years ago
- deep learning based speech enhancement using keras or pytorch, make it easy to use☆339Feb 26, 2020Updated 6 years ago
- Python functions to convert between different speech quality metrics☆54Apr 25, 2018Updated 7 years ago
- Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"☆12Nov 25, 2021Updated 4 years ago
- An implementation of a sound dereverberation algorithm by Gilbert Soulodre☆23Apr 26, 2018Updated 7 years ago
- Weighted RLS based adaptive dereverberation algorithm☆28Nov 25, 2019Updated 6 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆67Nov 28, 2018Updated 7 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- A perceptual weighting filter loss for DNN training in speech enhancement☆24Apr 30, 2022Updated 3 years ago
- Multiple Fundamental Frequency Estimation☆27Apr 7, 2014Updated 11 years ago
- Mel-Generalized Cepstrum analysis☆20Jul 21, 2017Updated 8 years ago
- Source data, scripts and makefiles of the experiment for the Speex codec quality evaluation☆22Aug 29, 2011Updated 14 years ago
- speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN☆36Apr 16, 2018Updated 7 years ago
- An implementation of the Prism layer (https://arxiv.org/abs/2011.04823)☆12Nov 13, 2020Updated 5 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Jun 24, 2019Updated 6 years ago
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- Efficient Neural Architecture Search via Straight-Through Gradients☆13Nov 12, 2020Updated 5 years ago
- ☆10May 22, 2023Updated 2 years ago
- ☆20Apr 11, 2019Updated 6 years ago
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆44Oct 14, 2021Updated 4 years ago