taresh18 / TTSizerLinks
šļø Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets āØ
ā90Updated last month
Alternatives and similar repositories for TTSizer
Users that are interested in TTSizer are comparing it to the libraries listed below
Sorting:
- An unofficial PyTorch implementation of VALL-Eā87Updated last month
- StyleTTS 2 Optimized Training Forkā32Updated 5 months ago
- High quality text-to-speech based on StyleTTS 2.ā52Updated this week
- Official repository of Wavehax vocoderā54Updated 7 months ago
- Open TTS models, built for streaming on the edgeā43Updated 3 months ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"ā100Updated last month
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPā¦ā100Updated 9 months ago
- ā40Updated 5 months ago
- VoiceBox neural network implementationā109Updated 11 months ago
- Official Code for ParrotTTSā52Updated 9 months ago
- Google's SoundStorm: Efficient Parallel Audio Generationā132Updated last year
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).ā78Updated this week
- Audio tokenization, in the fastest way possible!ā52Updated 10 months ago
- VALL-E 2 reproductionā129Updated 11 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,ā¦ā74Updated 9 months ago
- Official implementation of the TTS model Lina-Speechā165Updated 6 months ago
- ā50Updated last week
- ā50Updated 3 months ago
- ā57Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.ā63Updated last month
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.ā41Updated last week
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesisā135Updated 6 months ago
- ā26Updated 8 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversionā92Updated last year
- A trainer for SNAC (Multi-Scale Neural Audio Codec) has replaced the decoder with Vocos.ā55Updated 8 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variabilityā102Updated 5 months ago
- GPT-style network for phonemization with durations of textā66Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.ā109Updated last month
- AudioSR-Upsampling (any -> 48kHz)ā41Updated last year
- Llasa Speed Upā35Updated last month