viitor-ai / viitor-voice
An LLM base TTS engine
☆79Updated 4 months ago
Alternatives and similar repositories for viitor-voice
Users that are interested in viitor-voice are comparing it to the libraries listed below
Sorting:
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆95Updated 4 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆96Updated last month
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆49Updated 10 months ago
- ☆57Updated 10 months ago
- All generative model in one for better TTS model☆71Updated 8 months ago
- ☆40Updated 3 months ago
- ☆26Updated 3 months ago
- Official Code for ParrotTTS☆50Updated 7 months ago
- Huawei Grad-TTS for Chinese☆50Updated last year
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆74Updated last week
- ☆65Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆90Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆68Updated 6 months ago
- The open source code for SimpleSpeech series☆138Updated 7 months ago
- ☆71Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆69Updated last year
- ☆68Updated 8 months ago
- ☆29Updated last year
- ☆18Updated last year
- ☆19Updated 6 months ago
- ☆50Updated last month
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆134Updated 4 months ago
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆58Updated last month
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆97Updated last year
- High quality text-to-speech based on StyleTTS 2.☆42Updated this week
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆101Updated 4 months ago
- A trainer for SNAC (Multi-Scale Neural Audio Codec) has replaced the decoder with Vocos.☆52Updated 6 months ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆118Updated last month
- faster inference☆28Updated 3 months ago