CanCLID / ToJyutpingLinks
粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool
☆76Updated last year
Alternatives and similar repositories for ToJyutping
Users that are interested in ToJyutping are comparing it to the libraries listed below
Sorting:
- Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.☆50Updated 5 months ago
- 基于vits fastspeech2 visinger的tts模型☆24Updated 2 years ago
- Grapheme-to-Phoneme lexicons for Chinese dialects☆68Updated 2 years ago
- Monotonic Alignment Search☆96Updated 4 months ago
- 基于 g2pW 提升 pypinyin 的准确性☆101Updated 2 years ago
- 使用 pinyin-data 和 phrase-pinyin-data 中的拼音数据文件覆盖 pypinyin 中的内置拼音数据☆63Updated 8 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆85Updated this week
- Pre-trained grapheme-to-phoneme (G2P) models☆25Updated 4 years ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆76Updated 2 years ago
- A method that directly addresses the modality gap by aligning speech token with the corresponding text transcription during the tokenizat…☆92Updated last month
- Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis☆226Updated 2 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆52Updated last year
- ☆55Updated 3 years ago
- ONNX deployment of the CREPE pitch tracker☆23Updated 2 years ago
- ☆92Updated last year
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆105Updated 9 months ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Updated 3 years ago
- Cantonese TTS frontend☆16Updated 5 years ago
- 一个基于Fastspeech的开源歌声合成系统☆57Updated 2 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆102Updated last year
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆109Updated 6 months ago
- Extract phoneme-level timestamps from speeh audio.☆79Updated last week
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated 2 years ago
- ☆66Updated 2 years ago
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆150Updated last year
- Singing Voice Synthesis based on VITS, different from VISinger☆190Updated last year
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆53Updated 2 years ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆105Updated last year
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆57Updated 2 years ago
- Code for DeSTA2.5-Audio☆115Updated 2 months ago