tigthor / Voice-Cloning-AILinks
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. Feel free to check my thesis if you're curious or if you're looking for info I haven't documented.
☆34Updated 3 years ago
Alternatives and similar repositories for Voice-Cloning-AI
Users that are interested in Voice-Cloning-AI are comparing it to the libraries listed below
Sorting:
- Voice cloning AI (deepfake for voice). Using cloned voice from only 5-10 seconds of targeted voice.☆66Updated 3 years ago
- Text prompt steered synthetic audio generators☆47Updated 3 months ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆170Updated 4 years ago
- An audio deepfake is when a “cloned” voice that is potentially indistinguishable from the real person’s is used to produce synthetic audi…☆63Updated last year
- an improved version of Real-time-voice-cloning☆50Updated last year
- ☆22Updated 3 years ago
- 💬 "Realtime" voice transcription and cloning using ElevenLabs's API.☆53Updated 2 years ago
- Copy the voice of anyone☆50Updated 8 years ago
- One Shot Voice Cloning base on Unet-TTS☆242Updated 3 years ago
- [WIP] VoiceSmith makes training text to speech models easy.☆225Updated 2 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into Your Favorite Celebrity's or Your Customized Voice. Our Cutting…☆55Updated 3 months ago
- Audio datasets, easier.☆84Updated last year
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Voice activated Python interface for Bard AI. Implements open sourced reverse engineered Bard API, local text to speech and OpenAI Whispe…☆60Updated last year
- ☆18Updated 2 years ago
- Community framework for training tortoise☆43Updated 2 years ago
- AudioLDM text to audio colab☆19Updated last year
- Prompts for Music Generation☆32Updated last year
- RVC Inference with multiple model and huggingface support☆106Updated last year
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆20Updated 2 months ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- Fork of AudioLDM as a TuneFlow plugin☆42Updated 2 years ago
- Desktop application for neural speech synthesis written in C++☆215Updated 2 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆362Updated 2 years ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated 9 months ago
- Ai generated music video with Riffusion and Gradio☆21Updated 2 years ago
- Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.☆98Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- 🐸 - A general purpose model trainer, as flexible as it gets☆220Updated last year