anhnh2002/XTTSv2-Finetuning-for-New-Languages

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/anhnh2002/XTTSv2-Finetuning-for-New-Languages)

anhnh2002 / XTTSv2-Finetuning-for-New-Languages

☆205

Alternatives and similar repositories for XTTSv2-Finetuning-for-New-Languages

Users that are interested in XTTSv2-Finetuning-for-New-Languages are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tuanh123789 / Train_Hifigan_XTTS
View on GitHub
This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.
☆87Nov 12, 2024Updated last year
idiap / coqui-ai-Trainer
View on GitHub
🐸 - A general purpose model trainer, as flexible as it gets
☆16Apr 10, 2026Updated 3 months ago
v-nhandt21 / ViMFA
View on GitHub
Montreal Forced Aligner for Vietnamese
☆15Oct 23, 2023Updated 2 years ago
nii-yamagishilab / ZMM-TTS
View on GitHub
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
☆183Mar 6, 2024Updated 2 years ago
tuanh123789 / Spark-TTS-finetune
View on GitHub
finetune llm part for spark-tts model
☆125Mar 25, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
stlohrey / chatterbox-finetuning
View on GitHub
SoTA open-source TTS
☆136Jun 7, 2025Updated last year
idiap / coqui-ai-TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆2,295Jun 10, 2026Updated last month
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago
freds0 / kabooks
View on GitHub
KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…
☆13Mar 24, 2023Updated 3 years ago
e-c-k-e-r / vall-e
View on GitHub
An unofficial PyTorch implementation of VALL-E
☆88Aug 3, 2025Updated 11 months ago
gauthelo / kallaama-speech-dataset
View on GitHub
A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.
☆20Mar 26, 2026Updated 3 months ago
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
zhenye234 / LLaSA_training
View on GitHub
LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
☆660Jan 21, 2026Updated 6 months ago
stlohrey / dia-finetuning
View on GitHub
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆131Jul 25, 2025Updated 11 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
neulab / AfricanVoices
View on GitHub
Hosts text-to-speech corpus and speech synthesizers for African languages.
☆19May 31, 2023Updated 3 years ago
anhnh2002 / vnhtr
View on GitHub
A Vietnamese handwriting recognition project
☆16Feb 21, 2024Updated 2 years ago
Choddeok / EmoSpherepp
View on GitHub
[TAFFC 2025] The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vec…
☆129Updated this week
mehedihasanbijoy / DPCSpell
View on GitHub
[Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages
☆14Aug 9, 2024Updated last year
mbzuai-nlp / sttatts
View on GitHub
☆31Oct 29, 2024Updated last year
alpoktem / bible2speechDB
View on GitHub
Scripts to create speech corpora from open.bible
☆13Jan 3, 2022Updated 4 years ago
thinhlpg / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆22Apr 7, 2024Updated 2 years ago
dangvansam / viet-tts
View on GitHub
VietTTS: An Open-Source Vietnamese Text to Speech
☆88Dec 23, 2025Updated 6 months ago
ylacombe / finetune-hf-vits
View on GitHub
Finetune VITS and MMS using HuggingFace's tools
☆202Mar 31, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MahmoudAshraf97 / ctc-forced-aligner
View on GitHub
Text to speech alignment using CTC forced alignment
☆523Jul 12, 2026Updated last week
daswer123 / xtts-webui
View on GitHub
Webui for using XTTS and for finetuning it
☆890Jan 17, 2025Updated last year
liuhuang31 / HiFTNet-sr
View on GitHub
HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz
☆24Jan 2, 2024Updated 2 years ago
v-nhandt21 / Viphoneme
View on GitHub
Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA
☆109Jun 21, 2024Updated 2 years ago
daswer123 / xtts-finetune-webui
View on GitHub
Slightly improved official version for finetune xtts
☆393Apr 3, 2025Updated last year
nguyenthienhy / F5-TTS-Vietnamese
View on GitHub
☆161Apr 23, 2025Updated last year
Edresson / ZS-TTS-Evaluation
View on GitHub
☆45Sep 19, 2024Updated last year
davidmartinrius / speech-dataset-generator
View on GitHub
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
☆262Jun 10, 2024Updated 2 years ago
EZ-VC / EZ-VC
View on GitHub
[EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion
☆43Sep 9, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
p0p4k / vits2_pytorch
View on GitHub
unofficial vits2-TTS implementation in pytorch
☆548Mar 28, 2024Updated 2 years ago
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
daniilrobnikov / vits2
View on GitHub
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
☆642Sep 11, 2023Updated 2 years ago
Vyvo-Labs / VyvoTTS
View on GitHub
VyvoTTS: LLM-Based Text-to-Speech Training Framework
☆257Apr 8, 2026Updated 3 months ago
theodorblackbird / lina-speech
View on GitHub
Official implementation of the TTS model Lina-Speech
☆178Jan 9, 2025Updated last year
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,016Dec 2, 2025Updated 7 months ago
sanchit-gandhi / whisper-flash-attention
View on GitHub
☆21Mar 7, 2023Updated 3 years ago