bshall / hifigan
An 16kHz implementation of HiFi-GAN for soft-vc.
☆98Updated last year
Alternatives and similar repositories for hifigan:
Users that are interested in hifigan are comparing it to the libraries listed below
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆85Updated 2 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- Official implementation of SpeechSplit2☆132Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆123Updated 2 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆116Updated last year
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆116Updated 2 years ago
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆134Updated last year
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆138Updated 2 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆120Updated 2 years ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆146Updated last year
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆145Updated 3 years ago
- An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"☆97Updated 2 years ago
- ☆117Updated 2 years ago
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆117Updated 3 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆163Updated last year
- Evaluation and Benchmarking of Speech Super-resolution Methods☆149Updated 2 years ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆144Updated last year
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆103Updated last year
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆147Updated 2 years ago
- Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training☆134Updated 2 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆104Updated 3 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆251Updated last year
- ☆112Updated 3 years ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆78Updated 2 years ago
- Train the next generation of TTS systems.☆165Updated 7 months ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 3 years ago
- Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"☆190Updated last year
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆113Updated 4 years ago