ishine / LangSegment
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool.它是一个TTS多语言(97种语言)的混合文本内容自动识别和拆分工具。
☆13Updated last year
Alternatives and similar repositories for LangSegment
Users that are interested in LangSegment are comparing it to the libraries listed below
Sorting:
- Chinese and English Bilinguish G2P☆21Updated last year
- 基于 g2pW 提升 pypinyin 的准确性☆89Updated last year
- Predict prosody labels for Chinese sentences.☆41Updated 2 years ago
- Forced Alignment-MFA☆39Updated 2 years ago
- Chinese Text Normalization and Dataset☆83Updated 2 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆97Updated last year
- ☆64Updated last year
- The Implementation of FastSpeech2 Based on Pytorch.☆52Updated last year
- Huawei Grad-TTS for Chinese☆50Updated last year
- The code for aishell-3 baseline acoustic model☆67Updated 4 years ago
- ☆75Updated 3 years ago
- The implementation of g2pL with a new open dataset.☆16Updated last year
- 论文复现,使用pos标记进行中文多音字消歧☆21Updated 5 years ago
- ☆33Updated 2 months ago
- 基于语言学本体构建,全面覆盖汉语多音字、音变等现象的高效中文TTS数据集。A linguistically grounded and comprehensive Chinese TTS dataset, efficiently covering Chinese polyph…☆27Updated 9 months ago
- Implementation of StyleTTS for Mandarin☆11Updated last year
- ☆33Updated 3 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆25Updated 4 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated last year
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆85Updated 2 years ago
- Singing Voice Speech modeling test☆35Updated 2 years ago
- ☆56Updated last year
- 基于随机森林和条件随机场的中文韵律预测模型☆28Updated 9 months ago
- 基于vits fastspeech2 visinger的tts模型☆23Updated 2 years ago
- A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆28Updated last month
- PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis☆72Updated 3 years ago
- ☆40Updated 8 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆96Updated last month
- TransferTTS (Zero-Shot learning of VITS)☆95Updated 2 years ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆49Updated 9 months ago