This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training more robust and stable.
☆38Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for improvedsegan
Users that are interested in improvedsegan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆26Apr 21, 2021Updated 4 years ago
- Keras implementation of speech enhancement based on LSGAN☆20Dec 10, 2017Updated 8 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆97May 30, 2020Updated 5 years ago
- A neural network consist of cnn and lstm for speech enhancement☆25Aug 2, 2018Updated 7 years ago
- Remove noise from sound clips by use of supervised training and an ideal ratio mask.☆14Apr 2, 2019Updated 6 years ago
- Vocode spectrograms to audio with generative adversarial networks☆64Aug 8, 2019Updated 6 years ago
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Mar 8, 2020Updated 6 years ago
- ☆23Apr 25, 2022Updated 3 years ago
- Improved Speech Enhancement GANs☆12Jun 24, 2020Updated 5 years ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆34Aug 10, 2017Updated 8 years ago
- A pytroch implementation of the FB-MelGAN☆90May 26, 2020Updated 5 years ago
- Audio source separation (mixture to vocal) using the Wavenet☆21Sep 6, 2017Updated 8 years ago
- Speech Enhancement Generative Adversarial Network in TensorFlow☆859Mar 24, 2023Updated 3 years ago
- Pytorch Implementation of FFTNet☆87Jun 20, 2018Updated 7 years ago
- Convolutional Neural Network for multitrack mix leveling☆18Jun 25, 2018Updated 7 years ago
- ☆18Nov 10, 2019Updated 6 years ago
- ☆10Sep 17, 2021Updated 4 years ago
- A fast cnn-based vocoder☆78Jun 11, 2020Updated 5 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆24Sep 16, 2020Updated 5 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- Fast parallel RNN-Transducer.☆10Nov 1, 2019Updated 6 years ago
- Quasi-Periodic Parallel WaveGAN Pytorch implementation☆46Oct 29, 2022Updated 3 years ago
- A pytorch implementation of FFTNet.☆37Aug 31, 2018Updated 7 years ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆32Jul 14, 2019Updated 6 years ago
- Implementation of audio degradation processes☆105Nov 18, 2015Updated 10 years ago
- Speech Denoising using RNNs in Tensorflow☆25Apr 20, 2018Updated 7 years ago
- ☆27Apr 12, 2018Updated 7 years ago
- J-Net is aimed for audio separation with randomly weighted encoder.☆12Oct 23, 2019Updated 6 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Dec 18, 2018Updated 7 years ago
- A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"☆155Oct 21, 2019Updated 6 years ago
- ☆13Jun 24, 2017Updated 8 years ago
- Speech waveform synthesis filters☆13Jul 21, 2017Updated 8 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆523Feb 17, 2022Updated 4 years ago
- A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆94Jul 17, 2018Updated 7 years ago
- SEGAN pytorch implementation https://arxiv.org/abs/1703.09452☆111Mar 11, 2019Updated 7 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Oct 10, 2019Updated 6 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15May 25, 2022Updated 3 years ago