liuhaozhe6788 / voice-cloning-collab
an improved version of Real-time-voice-cloning
☆50Updated last year
Alternatives and similar repositories for voice-cloning-collab:
Users that are interested in voice-cloning-collab are comparing it to the libraries listed below
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- ☆36Updated last year
- GUI to sync video mouth movements to match audio, utilizing wav2lip-hq. Completed as part of a technical interview.☆11Updated 11 months ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆35Updated 2 years ago
- Text prompt steered synthetic audio generators☆46Updated 2 weeks ago
- Code for the project: "Audio-Driven Video-Synthesis of Personalised Moderations"☆20Updated last year
- optimized wav2lip☆19Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 8 months ago
- Voice clone application in flask, forked version of CorentinJ Voice Cloning☆21Updated 4 years ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆66Updated last year
- ☆83Updated 10 months ago
- Official Implementation of StyleTTS-VC☆178Updated 3 months ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- Speech to Facial Animation using GANs☆40Updated 3 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- Video to video translation via few shot voice cloning & audio-based lip sync☆25Updated 10 months ago
- ☆27Updated last year
- One Shot Voice Cloning base on Unet-TTS☆241Updated 3 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆129Updated last year
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- One-shot face animation using webcam, capable of running in real time.☆37Updated 11 months ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆201Updated 2 years ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 6 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆69Updated 10 months ago
- AudioLDM text to audio colab☆19Updated last year
- lipsync is a simple and updated Python library for lip synchronization, based on Wav2Lip. It synchronizes lips in videos and images based…☆118Updated 3 months ago
- ☆74Updated 2 years ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆13Updated 4 years ago
- chinese real time voice cloning☆39Updated 5 years ago
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year