A cross platform implementation of Text-to-Speech based on ONNXRuntime.
☆32May 10, 2023Updated 2 years ago
Alternatives and similar repositories for RapidTTS
Users that are interested in RapidTTS are comparing it to the libraries listed below
Sorting:
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Oct 19, 2023Updated 2 years ago
- text to speech☆10Mar 19, 2024Updated last year
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Apr 7, 2021Updated 4 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆17Apr 27, 2023Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- End-To-End SpeechSynthesis system with knowledge distillation☆18Jul 16, 2022Updated 3 years ago
- cpp inference for EmotiVoice☆16Jan 1, 2024Updated 2 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- 单独维护的中文TTS☆34Oct 28, 2022Updated 3 years ago
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 2 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- ☆55Aug 11, 2022Updated 3 years ago
- singing voice conversion without f0☆23May 10, 2023Updated 2 years ago
- g2p for english tts☆19Nov 10, 2022Updated 3 years ago
- ☆54Jul 16, 2025Updated 7 months ago
- ONNX deployment of the CREPE pitch tracker☆26Oct 27, 2022Updated 3 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- wav2lip-api☆11Mar 16, 2023Updated 2 years ago
- ☆15Jul 14, 2020Updated 5 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- Streaming Vocos☆30Jun 10, 2025Updated 8 months ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023☆30Jul 29, 2023Updated 2 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilin…☆70Nov 21, 2022Updated 3 years ago
- ☆40Jul 15, 2025Updated 7 months ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- Reflex select component which allows the user to search for options and create new ones.☆13Nov 4, 2024Updated last year
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- ☆12Jul 27, 2022Updated 3 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- ☆14Aug 1, 2025Updated 6 months ago
- ☆11Mar 22, 2023Updated 2 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- Demo on iGPU for FFmpeg decode and scale, OpenVINO inference. this is zero-copy solution, which means No frame data copy from CPU to iGPU…☆17Jan 25, 2023Updated 3 years ago