A Python/Pytorch app for easily synthesising human voices
☆1,443Dec 2, 2024Updated last year
Alternatives and similar repositories for Voice-Cloning-App
Users that are interested in Voice-Cloning-App are comparing it to the libraries listed below
Sorting:
- One Shot Voice Cloning base on Unet-TTS☆244Mar 22, 2022Updated 3 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆360Mar 25, 2023Updated 2 years ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆59,483Dec 15, 2025Updated 2 months ago
- A multi-voice TTS system trained with an emphasis on quality☆14,820Nov 19, 2024Updated last year
- This repository has implementation for "Neural Voice Cloning With Few Samples"☆436Feb 23, 2021Updated 5 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.☆844Oct 10, 2023Updated 2 years ago
- A web app that lets you play around with TalkNet models☆124Jul 31, 2023Updated 2 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆171Sep 25, 2020Updated 5 years ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆3,341Aug 24, 2025Updated 6 months ago
- [WIP] VoiceSmith makes training text to speech models easy.☆229Oct 10, 2022Updated 3 years ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆44,763Aug 16, 2024Updated last year
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆1,052Nov 4, 2024Updated last year
- Fast TorToiSe inference (5x or your money back!)☆829Jul 10, 2024Updated last year
- The code for the bark-voicecloning model. Training and inference.☆710Sep 13, 2023Updated 2 years ago
- A webui for different audio related Neural Networks☆1,236May 19, 2025Updated 9 months ago
- Phoneme multilingual(Russian-English) voice cloning based on☆398Feb 7, 2021Updated 5 years ago
- TorToiSe fine-tuning with DLAS☆226Aug 1, 2024Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆5,304Jun 12, 2024Updated last year
- Voice clone application in flask, forked version of CorentinJ Voice Cloning☆21Jan 28, 2021Updated 5 years ago
- This repository does not contain code, its purpose it for issue tracking and wiki☆409May 2, 2023Updated 2 years ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- singing voice change based on whisper, and lora for singing voice clone☆648Nov 3, 2023Updated 2 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆360Apr 27, 2022Updated 3 years ago
- A python package to analyze and compare voices with deep learning☆3,225Oct 12, 2023Updated 2 years ago
- General Speech Restoration☆1,293Feb 17, 2025Updated last year
- An arbitrary face-swapping framework on images and videos with one single trained model!☆5,124Aug 6, 2024Updated last year
- General Speech Restoration☆283Jan 13, 2024Updated 2 years ago
- Demo for 2022 ICASSP☆64Jun 14, 2022Updated 3 years ago
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆237Feb 29, 2024Updated 2 years ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)☆10,118Nov 9, 2023Updated 2 years ago
- Performant and accurate speech recognition built on Pytorch☆254May 19, 2022Updated 3 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆2,324Jul 27, 2024Updated last year
- [CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.☆3,598Feb 10, 2024Updated 2 years ago
- An unofficial PyTorch implementation of the audio LM VALL-E☆2,992May 10, 2023Updated 2 years ago
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆12,852Jun 22, 2025Updated 8 months ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,389Jun 6, 2024Updated last year
- 🔊 Text-Prompted Generative Audio Model☆39,039Aug 19, 2024Updated last year
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Jul 24, 2023Updated 2 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆866Jul 22, 2023Updated 2 years ago