tigthor / Voice-Cloning-AI
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. Feel free to check my thesis if you're curious or if you're looking for info I haven't documented.
☆36Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Voice-Cloning-AI
- A properly working gui for First order model. Attempts to fully install, requires visual studio sdk for c++ 2017☆10Updated last year
- Text prompt steered synthetic audio generators☆45Updated 11 months ago
- an improved version of Real-time-voice-cloning☆45Updated 8 months ago
- A software pipeline for creating realistic videos of people talking, using only images.☆38Updated 3 years ago
- Voice cloning AI (deepfake for voice). Using cloned voice from only 5-10 seconds of targeted voice.☆52Updated 3 years ago
- Voice clone application in flask, forked version of CorentinJ Voice Cloning☆21Updated 3 years ago
- An audio deepfake is when a “cloned” voice that is potentially indistinguishable from the real person’s is used to produce synthetic audi…☆55Updated 8 months ago
- 💬 "Realtime" voice transcription and cloning using ElevenLabs's API.☆49Updated last year
- This is 3D Avatar chat bot based on Open AI GPT3, and google Text to speech integration and oculus lipsync. you feel like you are interac…☆24Updated 2 years ago
- Based on Tortoise-TTS and Real-Time-Voice-Cloning for Python, a C# Text-To-Speech program to clone the voice of other people with TTS.☆18Updated 3 months ago
- AudioLDM text to audio colab☆19Updated last year
- ☆21Updated 2 years ago
- Ai generated music video with Riffusion and Gradio☆19Updated last year
- An easy-to-use Video generation colab notebook☆13Updated last year
- Speech to Facial Animation using GANs☆41Updated 3 years ago
- [WIP] VoiceSmith makes training text to speech models easy.☆222Updated 2 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 5 years ago
- ☆11Updated 3 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆167Updated 4 years ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆33Updated 2 years ago
- Your one-stop solution for voice dataset creation☆112Updated 11 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- Video Translation with LipSync with OpenAi's whisper for ASR, YourTTS for TTS, and Wav2lip for lip sync.☆15Updated last year
- Uses ChatGPT, TTS, and Stable Diffusion to automatically generate videos☆28Updated last year
- text-to-audio-latent-diffusion☆34Updated last year
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆31Updated 2 years ago
- Copy the voice of anyone☆50Updated 7 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 2 years ago
- A combination of various deepfake algoritms to quickly create fake audio and video☆21Updated 4 years ago