EraX Text to Speech base on F5-TTS Base V1
☆80May 8, 2025Updated 9 months ago
Alternatives and similar repositories for viF5TTS
Users that are interested in viF5TTS are comparing it to the libraries listed below
Sorting:
- An Enhanced Version of Piper especially for Vietnamese :)☆28Apr 24, 2025Updated 10 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- ☆18Apr 28, 2021Updated 4 years ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Apr 6, 2025Updated 11 months ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆66Jan 1, 2025Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆52May 22, 2025Updated 9 months ago
- A Vietnamese Text-to-Speech library that provides high-quality speech synthesis with voice cloning capabilities☆103Jul 14, 2025Updated 7 months ago
- ViStreamASR - Real-Time Vietnamese Speech Recognition☆53Jul 12, 2025Updated 7 months ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 9 months ago
- ☆13Apr 18, 2025Updated 10 months ago
- ☆54Jul 16, 2025Updated 7 months ago
- text to speech☆10Mar 19, 2024Updated last year
- ☆41Nov 19, 2025Updated 3 months ago
- Cog implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆12Apr 16, 2025Updated 10 months ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- ☆10Apr 8, 2024Updated last year
- ☆11Jan 1, 2024Updated 2 years ago
- TTS Dia finetuning for Vietnamese☆124Dec 3, 2025Updated 3 months ago
- ☆11Mar 22, 2023Updated 2 years ago
- ☆14Aug 1, 2025Updated 7 months ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆25Dec 11, 2025Updated 2 months ago
- TTS Text Analyzer☆31Jul 20, 2023Updated 2 years ago
- ☆32Jul 27, 2022Updated 3 years ago
- python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn☆14Aug 13, 2024Updated last year
- PostHog with text analytics extensions, serving as an advanced LLM analytics platform.☆15Sep 17, 2024Updated last year
- Manipulating semantic data within Python☆18Jan 14, 2025Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 10 months ago
- Implementation of the paper: StyleBERT: Text-Audio Sentiment Analysis with Bi-directional Style Enhancement☆14Apr 10, 2023Updated 2 years ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 11 months ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"☆14Nov 5, 2024Updated last year
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆36Feb 11, 2025Updated last year
- Landing Page for All Things Source Separation☆36Sep 12, 2025Updated 5 months ago
- Creation of a multi user audio first annotation tool - GSoC 2021☆29Mar 30, 2023Updated 2 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- unplugin-version-injector is a powerful and lightweight plugin that automatically injects the version number and build timestamp into all…☆19Jun 6, 2025Updated 9 months ago