stlohrey / chatterbox-finetuningLinks
SoTA open-source TTS
☆28Updated 2 weeks ago
Alternatives and similar repositories for chatterbox-finetuning
Users that are interested in chatterbox-finetuning are comparing it to the libraries listed below
Sorting:
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆21Updated last month
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated last month
- ☆33Updated 2 months ago
- High quality text-to-speech based on StyleTTS 2.☆51Updated last week
- ☆50Updated 2 months ago
- StyleTTS 2 Optimized Training Fork☆31Updated 4 months ago
- StyleTTS2 + Vocos as a Decoder☆12Updated 2 months ago
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆26Updated 2 weeks ago
- The Vokan Architecture (Tsukasa speech based)☆9Updated 4 months ago
- ☆15Updated last month
- Official Code for ParrotTTS☆51Updated 8 months ago
- A collection of all our phonemeizers for dataset construction and inference☆23Updated 4 months ago
- ☆26Updated 7 months ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Updated 3 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆55Updated 8 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆18Updated last month
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆26Updated last month
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆16Updated last month
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆36Updated 4 months ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆60Updated 2 months ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆19Updated 4 months ago
- [Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"☆32Updated last week
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆42Updated 6 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆101Updated 5 months ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆26Updated 4 months ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆29Updated last year
- Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.☆44Updated 2 weeks ago
- Supervoice diffusion enhance☆27Updated 11 months ago