linksense / ConvolutionaNeuralNetworksToEnhanceCodedSpeechLinks
In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral d…
☆28Updated 5 years ago
Alternatives and similar repositories for ConvolutionaNeuralNetworksToEnhanceCodedSpeech
Users that are interested in ConvolutionaNeuralNetworksToEnhanceCodedSpeech are comparing it to the libraries listed below
Sorting:
- ☆20Updated 4 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆55Updated 2 years ago
- ☆54Updated 6 years ago
- Distributed semi-constrained microphone arrays☆29Updated last year
- Keras implementation of speech enhancement based on LSGAN☆20Updated 7 years ago
- Audio samples for the paper "TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids"☆43Updated 5 years ago
- Filtering and Noise Adding Tool☆29Updated 3 years ago
- An Experimental Study on Speech Enhancement based on DNN.☆14Updated 6 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆37Updated 2 years ago
- A neural network consist of cnn and lstm for speech enhancement☆24Updated 7 years ago
- This is a project on working/resolving the speech separation problem using supervised learning on various training targets, building mach…☆34Updated 8 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Updated 6 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 6 years ago
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆45Updated 4 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 5 years ago
- ☆20Updated 5 years ago
- A python implementation of Speech intelligibility in bits (SIIB)☆25Updated 3 years ago
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆39Updated 3 years ago
- Kaldi Speech Processing Tools☆25Updated 6 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Updated 7 years ago
- ☆38Updated 5 years ago
- speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN☆35Updated 7 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 5 years ago
- Components loss for neural networks in mask-based speech enhancement☆33Updated 4 years ago
- A pitch tracker inspired by David Talkin's RAPT (Robust Algorithm for Pitch Tracking) written in Python.☆48Updated 9 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆66Updated 6 years ago
- Code to reproduce the experiments in the paper "Fast and stable blind source separation with rank-1 updates" presented at ICASSP 2020.☆21Updated 5 years ago
- Neural Dereverberation☆35Updated 6 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago
- A PyTorch implementation of Conv-TasNet☆46Updated 5 years ago