MartinMashalov/VoiceCloning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MartinMashalov/VoiceCloning)

MartinMashalov / VoiceCloning

Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the YourTTS TTS model to clone and generate realistic audio waves

☆47

Alternatives and similar repositories for VoiceCloning

Users that are interested in VoiceCloning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

seahore / PPG-GradVC
View on GitHub
A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis
☆45Jul 24, 2023Updated 3 years ago
CMsmartvoice / One-Shot-Voice-Cloning
View on GitHub
One Shot Voice Cloning base on Unet-TTS
☆243Mar 22, 2022Updated 4 years ago
keonlee9420 / Robust_Fine_Grained_Prosody_Control
View on GitHub
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
☆41Feb 20, 2022Updated 4 years ago
walker-hyf / FCTalker
View on GitHub
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)
☆26Feb 22, 2024Updated 2 years ago
justinjohn0306 / SpeedScribe
View on GitHub
High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…
☆10Sep 17, 2025Updated 10 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
IEEE-NITK / Neural-Voice-Cloning
View on GitHub
Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…
☆58Mar 23, 2019Updated 7 years ago
liuhaozhe6788 / voice-cloning-collab
View on GitHub
an improved version of Real-time-voice-cloning
☆52Mar 6, 2024Updated 2 years ago
deterministic-algorithms-lab / Cross-Lingual-Voice-Cloning
View on GitHub
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
☆359Mar 25, 2023Updated 3 years ago
CODEJIN / HierSpeech
View on GitHub
☆67Jul 16, 2023Updated 3 years ago
hcy71o / LPC_Speech_Synthesis
View on GitHub
Speech synthesis using LPC
☆25Jun 5, 2021Updated 5 years ago
lakahaga / dc-comix-tts
View on GitHub
Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer
☆74Aug 21, 2023Updated 2 years ago
fmiotello / fastVC
View on GitHub
A simple voice conversion tool
☆20Mar 10, 2022Updated 4 years ago
ronggong / phoneticSimilarity
View on GitHub
phonetic similarity algorithms
☆13Jun 19, 2018Updated 8 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
light1726 / BetaVAE_VC
View on GitHub
Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"
☆43Apr 10, 2023Updated 3 years ago
sooftware / tacotron2
View on GitHub
Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.
☆19Jan 21, 2021Updated 5 years ago
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
Rongjiehuang / GenerSpeech
View on GitHub
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
☆333Feb 9, 2024Updated 2 years ago
KevinMIN95 / StyleSpeech
View on GitHub
Official implementation of Meta-StyleSpeech and StyleSpeech
☆254Feb 9, 2022Updated 4 years ago
hcy71o / AutoVocoder
View on GitHub
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆71Dec 2, 2022Updated 3 years ago
CODEJIN / VITS_Diffusion
View on GitHub
☆26Sep 22, 2022Updated 3 years ago
kan-bayashi / LibriTTSLabel
View on GitHub
Alignment files of LibriTTS.
☆70Mar 16, 2020Updated 6 years ago
AverageMisesian / Janus_AI_WIN
View on GitHub
Face Swapping using Mediapipe and OpenCV
☆15Dec 5, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
madhu1995-oss / Pronunciation-and-Fluency-evaluation-using-machne-learning-and-DeepLearning
View on GitHub
☆13Apr 9, 2021Updated 5 years ago
jinny1208 / All-About-Speech
View on GitHub
☆14Apr 2, 2023Updated 3 years ago
echo3Dco / Unity-ARFoundation-echo3D-demo-Face-Change
View on GitHub
Simple face change demo with Unity, AR Foundation, and echo3D
☆12Sep 30, 2021Updated 4 years ago
LLM360 / website
View on GitHub
Website for LLM360
☆15Apr 27, 2026Updated 2 months ago
keonlee9420 / Comprehensive-E2E-TTS
View on GitHub
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…
☆147Jun 6, 2022Updated 4 years ago
hcy71o / SC-CNN
View on GitHub
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
☆39Nov 1, 2023Updated 2 years ago
SolomidHero / real-time-voice-conversion
View on GitHub
Toolbox for easy and qualitative one-shot voice conversion
☆48Dec 5, 2021Updated 4 years ago
hcy71o / MB-iSTFT-VITS-with-AutoVocoder
View on GitHub
Incorporating AutoVocoder to MB-iSTFT-VITS
☆47Dec 1, 2022Updated 3 years ago
b04901014 / UUVC
View on GitHub
Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…
☆83Jan 7, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Labmem-Zhouyx / CDFSE_FastSpeech2
View on GitHub
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆86Dec 20, 2022Updated 3 years ago
revsic / torch-nansypp
View on GitHub
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
☆152Feb 11, 2023Updated 3 years ago
SungFeng-Huang / Meta-TTS
View on GitHub
Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.
☆192Jun 8, 2023Updated 3 years ago
RejektsAI / EVC
View on GitHub
Easy Voice Cloning (Addons for RVC)
☆32Sep 10, 2023Updated 2 years ago
jinhan / tacotron2-gst
View on GitHub
Tacotron2 with Global Style Tokens
☆64Apr 19, 2019Updated 7 years ago
bshall / acoustic-model
View on GitHub
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
☆105Mar 10, 2026Updated 4 months ago
yl4579 / StyleTTS-VC
View on GitHub
Official Implementation of StyleTTS-VC
☆200Jan 14, 2025Updated last year