nguyenthienhy / F5-TTS-VietnameseView external linksLinks
☆136Apr 23, 2025Updated 9 months ago
Alternatives and similar repositories for F5-TTS-Vietnamese
Users that are interested in F5-TTS-Vietnamese are comparing it to the libraries listed below
Sorting:
- ☆40Nov 19, 2025Updated 2 months ago
- A Vietnamese Text-to-Speech library that provides high-quality speech synthesis with voice cloning capabilities☆101Jul 14, 2025Updated 7 months ago
- ☆43Sep 3, 2025Updated 5 months ago
- VietASR - Vietnamese Automatic Speech Recognition☆163Oct 29, 2024Updated last year
- ViStreamASR - Real-Time Vietnamese Speech Recognition☆52Jul 12, 2025Updated 7 months ago
- Vietnamese Text to Speech library☆251Aug 20, 2023Updated 2 years ago
- finetune llm part for spark-tts model☆120Mar 25, 2025Updated 10 months ago
- ☆11Apr 25, 2025Updated 9 months ago
- TTS Dia finetuning for Vietnamese☆123Dec 3, 2025Updated 2 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆52May 22, 2025Updated 8 months ago
- ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription☆77Updated this week
- End to End Speech to Speech with Emotion System☆15Feb 6, 2025Updated last year
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS☆57Dec 1, 2023Updated 2 years ago
- EraX Text to Speech base on F5-TTS Base V1☆79May 8, 2025Updated 9 months ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)☆194Nov 12, 2024Updated last year
- ☆11Jan 1, 2024Updated 2 years ago
- Dự án công cụ chuyển đổi giọng nói dành cho người Việt☆24Feb 9, 2026Updated last week
- A Python library for text normalization, specifically designed for Vietnamese and English text processing. This library provides comprehe…☆13Mar 30, 2025Updated 10 months ago
- ☆14Aug 19, 2024Updated last year
- In this repository I will be running various experiments on finetune different parts for xtts☆15Jun 22, 2024Updated last year
- AI-powered tool that transforms STEM concepts into narrated educational animations using Manim, LLMs, and multimodal AI☆74Oct 4, 2025Updated 4 months ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 5 months ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Oct 5, 2023Updated 2 years ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆66Jan 1, 2025Updated last year
- ☆14Jul 24, 2025Updated 6 months ago
- A modified VITS that utilizes phoneme duration's ground truth for better robustness☆152Aug 27, 2023Updated 2 years ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆68Jan 22, 2024Updated 2 years ago
- Visual Speech Recongnition☆19Dec 24, 2024Updated last year
- ☆27Jun 12, 2025Updated 8 months ago
- Fine-tuning Vietnamese Text-to-speech model (VITS)☆55Mar 18, 2025Updated 10 months ago
- ToRoLaMa: The Vietnamese Instruction-Following and Chat Model☆24Jan 4, 2024Updated 2 years ago
- Voice conversion with just linear regression.☆33Sep 25, 2025Updated 4 months ago
- ☆28Jun 5, 2024Updated last year
- Plugins for XaviaBot by everyone!☆11Jun 22, 2024Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26May 14, 2024Updated last year
- ☆55Apr 2, 2025Updated 10 months ago
- Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"☆41Jun 28, 2025Updated 7 months ago