fakerybakery / OpenF5-TTSLinks
(WIP) A retrain of F5-TTS on permissively-licensed data
☆13Updated 9 months ago
Alternatives and similar repositories for OpenF5-TTS
Users that are interested in OpenF5-TTS are comparing it to the libraries listed below
Sorting:
- StyleTTS 2 Optimized Training Fork☆33Updated 11 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 9 months ago
- High quality text-to-speech based on StyleTTS 2.☆71Updated last month
- ☆25Updated last year
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Updated 8 months ago
- ☆14Updated last year
- LEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10 languages: Chinese English Spanish Russian French German Ital…☆66Updated this week
- ☆50Updated 6 months ago
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆56Updated 5 months ago
- Collection of scripts from mHuBERT-147.☆32Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20Updated 7 months ago
- An unofficial PyTorch implementation of VALL-E☆88Updated 5 months ago
- ☆28Updated 2 years ago
- pytorch model for contexless-phoneme prediction from speech audio☆30Updated 2 months ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆62Updated 3 months ago
- Zero-Shot Emotion Style Transfer☆49Updated 8 months ago
- ☆29Updated 11 months ago
- ☆61Updated 2 years ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆43Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35Updated 8 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated 2 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated last year
- Official PyTorch implementation of (ICME2025) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speec…☆17Updated 10 months ago
- speaker-disentangled speech linguistic content quantizer☆24Updated 10 months ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆143Updated 3 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Updated last year
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆141Updated 7 months ago
- ☆58Updated last year
- My vocoder experiments☆31Updated 5 months ago
- A TTS Trained on Universal Audio.☆41Updated 7 months ago