tigthor / Voice-Cloning-AILinks
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. Feel free to check my thesis if you're curious or if you're looking for info I haven't documented.
☆34Updated 3 years ago
Alternatives and similar repositories for Voice-Cloning-AI
Users that are interested in Voice-Cloning-AI are comparing it to the libraries listed below
Sorting:
- an improved version of Real-time-voice-cloning☆49Updated last year
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- ☆22Updated 3 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆170Updated 4 years ago
- Text prompt steered synthetic audio generators☆47Updated 2 months ago
- AudioLDM text to audio colab☆19Updated last year
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated 8 months ago
- Voice cloning AI (deepfake for voice). Using cloned voice from only 5-10 seconds of targeted voice.☆65Updated 3 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆157Updated 3 years ago
- Easy Voice Cloning (Addons for RVC)☆30Updated last year
- Voice clone application in flask, forked version of CorentinJ Voice Cloning☆21Updated 4 years ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆36Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆26Updated 2 years ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆194Updated 2 years ago
- One Shot Voice Cloning base on Unet-TTS☆242Updated 3 years ago
- Community framework for training tortoise☆42Updated 2 years ago
- Fork of AudioLDM as a TuneFlow plugin☆41Updated 2 years ago
- Audio datasets, easier.☆84Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.☆225Updated 2 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆128Updated 2 years ago
- ☆27Updated last year
- An audio deepfake is when a “cloned” voice that is potentially indistinguishable from the real person’s is used to produce synthetic audi…☆63Updated last year
- Implementation of Emo-StarGAN☆45Updated last year
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆209Updated last year
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆60Updated 2 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆125Updated 3 years ago
- Official Implementation of StyleTTS☆435Updated 5 months ago