pengzhendong / wetextLinks
Python runtime for WeTextProcessing (does not depend on Pynini)
☆46Updated last month
Alternatives and similar repositories for wetext
Users that are interested in wetext are comparing it to the libraries listed below
Sorting:
- ☆23Updated last year
- faster inference☆28Updated last year
- Streaming Text to Speech Web UI☆22Updated last year
- CTC decoder with hotwords for ASR.☆34Updated 9 months ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆63Updated 4 months ago
- (WIP)long form speech generatoins☆31Updated 9 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Updated last month
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Updated last year
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Updated last year
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆55Updated 4 months ago
- ☆29Updated 11 months ago
- Chinese and English Bilinguish G2P☆22Updated 2 years ago
- Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems☆68Updated 3 months ago
- ☆22Updated 5 months ago
- Megatts2 use HierSpeechpp's vocoder☆18Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Updated 2 years ago
- noise reduction☆17Updated last year
- ☆33Updated 2 years ago
- ☆45Updated 5 years ago
- ☆23Updated last year
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Updated last year
- Chinese polyphone disambiguation for Text-to-Speech application☆41Updated last year
- CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO Fine-Tuning!☆112Updated 5 months ago
- ☆36Updated 4 months ago
- Huawei Grad-TTS for Chinese☆51Updated 2 years ago
- ☆68Updated 2 years ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Updated 4 months ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated 2 years ago
- Open Source Speech/Text Data on AI☆19Updated 3 years ago
- Decoders from Kaldi using OpenFst☆34Updated 5 months ago