noicevice / awesome-voice-cloning
☆64Updated 4 years ago
Alternatives and similar repositories for awesome-voice-cloning:
Users that are interested in awesome-voice-cloning are comparing it to the libraries listed below
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- ☆129Updated last year
- A gui to help make a text to speech dataset.☆18Updated 2 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 5 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆53Updated 2 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆129Updated 3 years ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- [WIP] VoiceSmith makes training text to speech models easy.☆224Updated 2 years ago
- Community framework for training tortoise☆40Updated 2 years ago
- Desktop application for neural speech synthesis written in C++☆213Updated 2 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆359Updated last year
- Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)☆188Updated 11 months ago
- General Speech Restoration☆275Updated last year
- Tools to create your own voice dataset for TTS training☆66Updated 4 years ago
- Your one-stop solution for voice dataset creation☆117Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆81Updated last year
- One Shot Voice Cloning base on Unet-TTS☆241Updated 2 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆170Updated 7 months ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher☆178Updated last year
- Multi-voice singing voice synthesis☆236Updated last year
- DLAS - A configuration-driven trainer for generative models☆138Updated 2 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆319Updated 7 months ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆168Updated 4 years ago
- Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach☆67Updated 2 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆368Updated 6 years ago
- Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structur…☆90Updated 7 years ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated last year
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago