zhaohb / MeloTTS-OVLinks
Using OpenVINO to speed up MeloTTS inference
☆12Updated 8 months ago
Alternatives and similar repositories for MeloTTS-OV
Users that are interested in MeloTTS-OV are comparing it to the libraries listed below
Sorting:
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Updated last year
- mnn tts demo.☆17Updated 2 months ago
- mnn asr demo.☆22Updated 3 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆19Updated last month
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated 3 months ago
- Project of Singing Voice Conversion.☆15Updated last year
- ☆28Updated 5 months ago
- ☆12Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆32Updated 5 months ago
- Cantonese Text to Speech with VITS implementation☆31Updated 2 years ago
- ☆13Updated 10 months ago
- A collection of all our phonemeizers for dataset construction and inference☆24Updated 4 months ago
- An Open-source Streaming High-fidelity Neural Audio Codec☆11Updated last year
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆13Updated 7 months ago
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆15Updated 9 months ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- ☆19Updated last year
- The Vokan Architecture (Tsukasa speech based)☆10Updated 5 months ago
- text to speech☆10Updated last year
- Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]☆8Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆27Updated 2 months ago
- g2p for english tts☆19Updated 2 years ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 3 months ago
- noise reduction☆17Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- ☆11Updated last year
- MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making config.json + Training, Inference) ONE-CLICK☆13Updated last year
- ☆22Updated 9 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆23Updated this week