rishikksh20 / TFGAN
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
☆86Updated 3 years ago
Related projects: ⓘ
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆101Updated 3 years ago
- Implementation of the AlignTTS☆76Updated last year
- ☆45Updated 4 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆133Updated 2 years ago
- PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification☆28Updated 3 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆79Updated 3 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆67Updated 3 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 2 years ago
- Alignment files of LibriTTS.☆57Updated 4 years ago
- A pytroch implementation of the FB-MelGAN☆84Updated 4 years ago
- PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to Enhancing Monotonicity for Robust Autoregressive Transformer …☆31Updated 3 years ago
- Pytorch implementation of subband decomposition☆88Updated 2 years ago
- Speech (audio) subjective evaluation system☆37Updated 4 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis☆71Updated 3 years ago
- MelGAN implementation with Multi-Band and Full Band supports...☆60Updated 4 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Updated 2 years ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆73Updated last year
- VAE Tacotron 2, an alternative of GST Tacotron☆85Updated last year
- ☆54Updated 3 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆151Updated 2 years ago
- Voice conversion (VC) investigation using three variants of VAE☆56Updated 4 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated last year
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆86Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆81Updated last year
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆52Updated last year
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆187Updated last year
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆116Updated 2 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆91Updated last year
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago