iamdinhthuan / index-tts-finetune-vietnameseLinks
☆38Updated 2 months ago
Alternatives and similar repositories for index-tts-finetune-vietnamese
Users that are interested in index-tts-finetune-vietnamese are comparing it to the libraries listed below
Sorting:
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆52Updated 7 months ago
- EraX Text to Speech base on F5-TTS Base V1☆79Updated 8 months ago
- VietTTS: An Open-Source Vietnamese Text to Speech☆79Updated 3 weeks ago
- ViStreamASR - Real-Time Vietnamese Speech Recognition☆52Updated 6 months ago
- SoTA open-source TTS☆131Updated 7 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆127Updated 5 months ago
- finetune llm part for spark-tts model☆120Updated 9 months ago
- Echo-TTS inference codebase☆75Updated last month
- ☆296Updated 5 months ago
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS☆57Updated 2 years ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆86Updated last year
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆132Updated 5 months ago
- ☆245Updated 3 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆218Updated 8 months ago
- A Vietnamese Text-to-Speech library that provides high-quality speech synthesis with voice cloning capabilities☆101Updated 6 months ago
- Fast audio super resolution from 16khz to 48khz.☆177Updated 2 weeks ago
- ☆54Updated this week
- ☆133Updated 8 months ago
- An Enhanced Version of Piper especially for Vietnamese :)☆24Updated 8 months ago
- A high quality and fast TTS repository☆461Updated 3 weeks ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 9 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆47Updated 4 months ago
- PersonaPlex code.☆110Updated this week
- Running the F5-TTS by ONNX Runtime☆190Updated last week
- ☆186Updated last year
- The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…☆140Updated last month
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆81Updated last year
- High quality text-to-speech based on StyleTTS 2.☆71Updated last month
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆196Updated 3 months ago
- VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency☆181Updated 2 months ago