viitor-ai / viitor-voiceLinks
An LLM base TTS engine
☆81Updated 6 months ago
Alternatives and similar repositories for viitor-voice
Users that are interested in viitor-voice are comparing it to the libraries listed below
Sorting:
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆78Updated last week
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆244Updated last week
- ☆40Updated 5 months ago
- ☆57Updated last year
- All generative model in one for better TTS model☆71Updated 10 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Updated last year
- Official Code for ParrotTTS☆52Updated 9 months ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆100Updated last month
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆93Updated last month
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆93Updated last year
- A trainer for SNAC (Multi-Scale Neural Audio Codec) has replaced the decoder with Vocos.☆55Updated 8 months ago
- ☆71Updated 2 years ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆73Updated 8 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆105Updated 3 months ago
- ☆33Updated 2 years ago
- ☆28Updated 5 months ago
- Chinese and English Bilinguish G2P☆21Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆32Updated 5 months ago
- ☆68Updated 10 months ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆74Updated last year
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated last year
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆100Updated 6 months ago
- faster inference☆28Updated 5 months ago
- ☆65Updated last year
- Torchaudio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆11Updated 6 months ago
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆38Updated 3 weeks ago
- a lightweight voice conversion☆84Updated 10 months ago
- ☆19Updated last year
- ☆29Updated last year
- Huawei Grad-TTS for Chinese☆50Updated last year