v-nhandt21/ViSV2TTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/v-nhandt21/ViSV2TTS)

v-nhandt21 / ViSV2TTS

Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS

☆56

Alternatives and similar repositories for ViSV2TTS

Users that are interested in ViSV2TTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

v-nhandt21 / Viphoneme
View on GitHub
Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA
☆109Jun 21, 2024Updated 2 years ago
v-nhandt21 / Vinorm
View on GitHub
Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…
☆68Jan 1, 2025Updated last year
v-nhandt21 / ViMFA
View on GitHub
Montreal Forced Aligner for Vietnamese
☆15Oct 23, 2023Updated 2 years ago
NTT123 / light-speed
View on GitHub
A modified VITS that utilizes phoneme duration's ground truth for better robustness
☆158Aug 27, 2023Updated 2 years ago
NTT123 / vietTTS
View on GitHub
Vietnamese Text to Speech library
☆257Aug 20, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
thinhlpg / vixtts-demo
View on GitHub
A Vietnamese Voice Cloning Text-to-Speech Model ✨
☆517Apr 4, 2025Updated last year
dangvansam / viet-tts
View on GitHub
VietTTS: An Open-Source Vietnamese Text to Speech
☆88Dec 23, 2025Updated 7 months ago
nguyenvulebinh / VietVoice-TTS
View on GitHub
A Vietnamese Text-to-Speech library that provides high-quality speech synthesis with voice cloning capabilities
☆105Jul 14, 2025Updated last year
NTT123 / Vietnamese-Text-To-Speech-Dataset
View on GitHub
A synthesized dataset for Vietnamese TTS task
☆66May 6, 2022Updated 4 years ago
phineas-pta / speech-synthesis-ngngngan
View on GitHub
python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn
☆15Aug 13, 2024Updated last year
telexyz / GPT4VN
View on GitHub
Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu
☆112Jun 10, 2023Updated 3 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
iamdinhthuan / Vira-tts
View on GitHub
☆25Feb 3, 2026Updated 5 months ago
BiSinger-SVS / BiSinger
View on GitHub
Bilingual Singing Voice Synthesis
☆18Mar 25, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
VinAIResearch / VinAI_Translate
View on GitHub
A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)
☆142Jul 22, 2024Updated 2 years ago
nguyenvulebinh / visen
View on GitHub
ViSen is library to format tone of Vietnamese sentences
☆22Nov 9, 2021Updated 4 years ago
dangtr0408 / StyleTTS2-lite
View on GitHub
A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.
☆50May 22, 2025Updated last year
heraclex12 / vietpunc
View on GitHub
Vietnamese Punctuation Prediction using Pretrained Language Models
☆14May 8, 2022Updated 4 years ago
TuananhCR / Dia-Finetuning-Vietnamese
View on GitHub
TTS Dia finetuning for Vietnamese
☆128Dec 3, 2025Updated 7 months ago
DongKeon / webrtc-whisper-asr
View on GitHub
WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.
☆13Sep 27, 2024Updated last year
freds0 / free-svc
View on GitHub
[ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion
☆95Jul 23, 2025Updated last year
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
monokaijs / fb-story-downloader
View on GitHub
☆21Jul 6, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CodeLinkIO / Vietnamese-text-normalization
View on GitHub
☆17Jul 6, 2023Updated 3 years ago
nguyenthienhy / F5-TTS-Vietnamese
View on GitHub
☆161Apr 23, 2025Updated last year
phatjkk / vits-tts-vietnamese
View on GitHub
Fine-tuning Vietnamese Text-to-speech model (VITS)
☆66Mar 18, 2025Updated last year
AI-Unicamp / TTS-Objective-Metrics
View on GitHub
Objective metrics used in several text-to-speech (TTS) papers.
☆54Jun 17, 2025Updated last year
pbcquoc / vietnamese_word_seperate
View on GitHub
Seperate vietnamese using lstm
☆18Aug 17, 2018Updated 7 years ago
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year
tatianapassali / artificial-disfluency-generation
View on GitHub
Generating artificial disfluencies from fluent text easily and promptly
☆16Sep 28, 2022Updated 3 years ago
gitmylo / bark-data-gen
View on GitHub
Create training data for training a voice cloner for bark text to speech.
☆47Jun 13, 2023Updated 3 years ago
souvikg544 / TTS_Data_Maker
View on GitHub
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…
☆28Mar 14, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
v-nhandt21 / MusicVoiceConversion
View on GitHub
Sing any popular song with your voice
☆11Jul 10, 2022Updated 4 years ago
Mikxox / EnCodec_Trainer
View on GitHub
☆67Apr 3, 2023Updated 3 years ago
ShmuelRonen / ComfyUI-Veo2-Experimental
View on GitHub
A custom node extension for ComfyUI that integrates Google's Veo 2 text-to-video generation capabilities.
☆33Apr 12, 2025Updated last year
fgnt / speaker_reassignment
View on GitHub
Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
☆14Feb 5, 2025Updated last year
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
ozspeech / OZSpeech
View on GitHub
[ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching
☆45Feb 9, 2025Updated last year
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago