Keith-Hon / vits-cantoneseLinks
Cantonese Text to Speech with VITS implementation
☆32Updated 2 years ago
Alternatives and similar repositories for vits-cantonese
Users that are interested in vits-cantonese are comparing it to the libraries listed below
Sorting:
- ☆58Updated last year
- ☆29Updated 6 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆101Updated 9 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆81Updated this week
- a lightweight voice conversion☆84Updated 11 months ago
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- Official Code for ParrotTTS☆53Updated 9 months ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆107Updated 2 months ago
- Official implementation of the TTS model Lina-Speech☆167Updated 6 months ago
- High quality text-to-speech based on StyleTTS 2.☆57Updated this week
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 2 months ago
- ☆29Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆83Updated 11 months ago
- noise reduction☆17Updated last year
- ☆50Updated 4 months ago
- Finetuning VITS Efficiently☆33Updated last year
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆105Updated 4 months ago
- finetune llm part for spark-tts model☆102Updated 4 months ago
- ☆40Updated 2 months ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Updated 2 years ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆12Updated last week
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆127Updated 2 years ago
- ☆65Updated last month
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆96Updated 2 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆99Updated last year
- Unofficial implementation of wavenext vocoder☆48Updated 11 months ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆101Updated 7 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- CTC decoder with hotwords for ASR.☆20Updated 3 months ago
- Python Wrapper of Silero VAD☆57Updated 2 months ago