EndlessReform / smoltts
Open TTS models, built for streaming on the edge
☆39Updated last month
Alternatives and similar repositories for smoltts:
Users that are interested in smoltts are comparing it to the libraries listed below
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 2 weeks ago
- Audio tokenization, in the fastest way possible!☆51Updated 8 months ago
- StyleTTS 2 Optimized Training Fork☆27Updated 2 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆104Updated 2 weeks ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆14Updated last month
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆17Updated 5 months ago
- ☆62Updated 9 months ago
- A collection of all our phonemeizers for dataset construction and inference☆22Updated 2 months ago
- a Frontier Japanese Speech Generation net☆31Updated last month
- High quality text-to-speech based on StyleTTS 2.☆36Updated this week
- VoiceBox neural network implementation☆106Updated 8 months ago
- Official Code for ParrotTTS☆48Updated 6 months ago
- ☆27Updated 3 weeks ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated last month
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆70Updated this week
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- ☆40Updated 2 months ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆38Updated last week
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 10 months ago
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆38Updated 4 months ago
- ☆26Updated 5 months ago
- ☆50Updated 3 weeks ago
- GPT for FACodec☆13Updated last year
- Official repository of Wavehax vocoder☆46Updated 4 months ago
- ☆26Updated last year
- ☆35Updated last year
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆69Updated 6 months ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆48Updated this week
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆16Updated last year