noicevice / awesome-voice-cloning
☆64Updated 4 years ago
Alternatives and similar repositories for awesome-voice-cloning:
Users that are interested in awesome-voice-cloning are comparing it to the libraries listed below
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- A gui to help make a text to speech dataset.☆18Updated 2 years ago
- ☆130Updated 2 years ago
- Community framework for training tortoise☆41Updated 2 years ago
- Performant and accurate speech recognition built on Pytorch☆253Updated 2 years ago
- 🐸TTS recipes for different datasets☆86Updated 2 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆359Updated 2 years ago
- Your one-stop solution for voice dataset creation☆118Updated last year
- Deep Learning technology to upscale music.☆21Updated 4 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Updated 8 months ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆53Updated 2 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated 2 years ago
- [WIP] VoiceSmith makes training text to speech models easy.☆224Updated 2 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 3 years ago
- Prosody Transfer Tacotron☆19Updated 6 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆319Updated 8 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated 11 months ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆228Updated 2 years ago
- A python library to generate speech dataset from Youtube videos☆36Updated 9 months ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- TorToiSe fine-tuning with DLAS☆218Updated 8 months ago
- DLAS - A configuration-driven trainer for generative models☆139Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- Multi-voice singing voice synthesis☆237Updated 2 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source t…☆67Updated 3 years ago
- generate granular word-level captions in srt format☆57Updated 2 years ago
- General Speech Restoration☆276Updated last year