pengzhendong / streaming-tts-webui
Streaming Text to Speech Web UI
☆15Updated 9 months ago
Alternatives and similar repositories for streaming-tts-webui:
Users that are interested in streaming-tts-webui are comparing it to the libraries listed below
- (WIP)long form speech generatoins☆30Updated 2 months ago
- noise reduction☆17Updated 7 months ago
- ☆18Updated 4 months ago
- CTC decoder with hotwords for ASR.☆16Updated last month
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 5 months ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated 7 months ago
- 单独维护的中文TTS☆35Updated 2 years ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆48Updated 7 months ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆28Updated 11 months ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- silero-vad pytorch implement☆16Updated 2 months ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆18Updated 6 months ago
- ☆16Updated 3 months ago
- Chinese and English Bilinguish G2P☆20Updated last year
- Huawei Grad-TTS for Chinese☆46Updated last year
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆15Updated last year
- Singing Voice Speech modeling test☆35Updated 2 years ago
- Text-To-Speech for NotebookLM☆29Updated 2 months ago
- A simple command line tool to calculate WER for ASR.☆14Updated 4 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆63Updated 3 months ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆32Updated 4 months ago
- g2p for english tts☆18Updated 2 years ago
- Megatts2 use HierSpeechpp's vocoder☆17Updated 2 months ago
- ☆39Updated last year
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆16Updated last month
- A pitch detection model trained to be robust against noise and reverberation environments.☆23Updated last month
- ☆65Updated last year
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆85Updated 2 weeks ago
- An easy-to-use, fast, and easily integrable tool for evaluating audio LLM☆37Updated this week