EndlessReform / smolttsLinks
Open TTS models, built for streaming on the edge
☆43Updated 3 months ago
Alternatives and similar repositories for smoltts
Users that are interested in smoltts are comparing it to the libraries listed below
Sorting:
- StyleTTS 2 Optimized Training Fork☆31Updated 4 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆90Updated last month
- Audio tokenization, in the fastest way possible!☆52Updated 10 months ago
- ☆62Updated 11 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆24Updated last month
- Joint speech-language model - respond directly to audio!☆30Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆99Updated 8 months ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆85Updated 3 weeks ago
- ☆15Updated 3 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆17Updated 7 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆14Updated last year
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆77Updated this week
- ☆40Updated 4 months ago
- High quality text-to-speech based on StyleTTS 2.☆51Updated 2 weeks ago
- ☆22Updated this week
- An unofficial PyTorch implementation of VALL-E☆87Updated 3 weeks ago
- ☆50Updated 2 months ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated last year
- Official Code for ParrotTTS☆51Updated 8 months ago
- StyleTTS2 + Vocos as a Decoder☆12Updated 3 months ago
- The demo page of UniAudio☆34Updated last year
- VoiceBox neural network implementation☆109Updated 10 months ago
- Putting flows on top of neural transducers for better TTS☆62Updated this week
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated last month
- A collection of all our phonemeizers for dataset construction and inference☆24Updated 4 months ago
- ☆20Updated 2 years ago
- ☆25Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆39Updated last week
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆38Updated last week