wangzhaode / mnn-tts
mnn tts demo.
β11Updated this week
Alternatives and similar repositories for mnn-tts:
Users that are interested in mnn-tts are comparing it to the libraries listed below
- mnn asr demo.β13Updated this week
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.ποΈπ»β60Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++β16Updated 11 months ago
- ncnn HiFi-GANβ26Updated 5 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed dataβ10Updated 2 weeks ago
- StyleTTS 2 Optimized Training Forkβ26Updated last month
- A Tiny Project For ASR model training and Deploymentβ27Updated 2 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β12Updated last month
- β20Updated 5 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightningβ15Updated 5 months ago
- Project of Singing Voice Conversion.β14Updated last year
- Export an ONNX graph that performs ISTFT. Designed for TTS models.β24Updated 11 months ago
- Utilizes ONNX Runtime to transcribe audio into text.β18Updated last month
- 4G GPU & 10 Minutes for trainβ12Updated last year
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.β48Updated last week
- β12Updated 2 years ago
- VI-SVC model is just VITS without MAS and DurationPredictor.β10Updated last year
- β13Updated 7 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implementβ12Updated 6 months ago
- εη¬η»΄ζ€ηδΈζTTSβ35Updated 2 years ago
- text to speechβ10Updated last year
- silero-vad pytorch implementβ16Updated 4 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).β22Updated this week
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ27Updated 7 months ago
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme levelβ¦β13Updated 3 months ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ14Updated 3 months ago
- CTC decoder with hotwords for ASR.β17Updated 2 months ago
- Forced alignment decoder for Whisper.β14Updated last year
- Supervoice Speaker Separation Networkβ12Updated 10 months ago
- Pythonηι³ι’ε·₯ε ·β12Updated 4 months ago