b04901014 / ISGAN
☆21Updated 6 years ago
Alternatives and similar repositories for ISGAN:
Users that are interested in ISGAN are comparing it to the libraries listed below
- Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals☆50Updated 5 years ago
- A pytroch implementation of the FB-MelGAN☆88Updated 4 years ago
- ☆64Updated last year
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- ☆22Updated 6 years ago
- Components loss for neural networks in mask-based speech enhancement☆33Updated 4 years ago
- An implementation of SkipVQVC with various settings.☆75Updated 4 years ago
- Voice conversion (VC) investigation using three variants of VAE☆56Updated 5 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆68Updated 3 years ago
- Pytorch implementation of subband decomposition☆91Updated 2 years ago
- Yin pitch estimator in PyTorch☆115Updated 2 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Updated 3 years ago
- ☆50Updated 3 years ago
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)☆57Updated 2 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆37Updated last year
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆23Updated 2 years ago
- ☆47Updated 4 years ago
- Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assess…☆50Updated last month
- Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement☆77Updated 3 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- A PyTorch implementation of Conv-TasNet☆46Updated 5 years ago
- End-to-end waveform utterance enhancement for direct evaluation metrics optimization by fully convolutional neural networks (TASLP 2018)☆18Updated 5 years ago
- ☆91Updated 3 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆101Updated 3 years ago
- Multi-Phase Gammatone Filterbank (MP-GTF) construction for Python☆46Updated 4 years ago
- ☆51Updated 5 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement☆37Updated 4 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆44Updated 5 years ago