lsh950919 / sv2tts
☆12Updated 4 years ago
Alternatives and similar repositories for sv2tts
Users that are interested in sv2tts are comparing it to the libraries listed below
Sorting:
- Creates video from TTS output and viseme images.☆11Updated 2 years ago
- Speech to Facial Animation using GANs☆40Updated 3 years ago
- ☆40Updated last year
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated 7 months ago
- an improved version of Real-time-voice-cloning☆50Updated last year
- Auto-Video maker handling many AI's☆10Updated last year
- ☆13Updated last year
- Grayscale SAEHD model and mode for training deepfakes. Notes, tests, experience, tools, study and explanations of the source code.☆49Updated 11 months ago
- ☆27Updated last year
- Code for the project: "Audio-Driven Video-Synthesis of Personalised Moderations"☆20Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆34Updated 2 years ago
- A python library to find differences between audio and transcriptions☆20Updated last year
- ☆37Updated 7 months ago
- AudioLDM text to audio colab☆19Updated last year
- Automatically generate a lip-synced avatar based off of a transcript and audio☆13Updated 2 years ago
- ☆55Updated last year
- Implementing an interactive AI avatar using Python, Blender and GPT☆10Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated 3 weeks ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆42Updated last month
- GAN deep learning model to use AI generated faces from /gan_facegenerator, turns them into cartoon characters, and animates them.☆16Updated last year
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆14Updated 4 years ago
- ☆23Updated last year
- Swap faces with AI from a source image to a destination medium. Img2Img, Img2GIF, & Img2MP4☆29Updated last year
- Talking head animation☆27Updated last year
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Cog wrapper for IP-Adapter-FaceID☆19Updated last year
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆17Updated 2 years ago
- StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and imp…☆11Updated last year