maxilevi / vits.cppLinks
a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile devices. this is my undergraduate project
β43Updated last year
Alternatives and similar repositories for vits.cpp
Users that are interested in vits.cpp are comparing it to the libraries listed below
Sorting:
- Port of Meta's Encodec in C/C++β227Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.ποΈπ»β62Updated 2 years ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.β93Updated 4 months ago
- Port of Funasr's Paraformer model in C/C++β39Updated last year
- C++ library for converting text to phonemes for Piperβ139Updated 6 months ago
- TTS support with GGMLβ218Updated 4 months ago
- A ggml (C++) re-implementation of tortoise-ttsβ193Updated last year
- Export an ONNX graph that performs ISTFT. Designed for TTS models.β27Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++β18Updated last year
- β67Updated 6 months ago
- zero-shot realtime TTS system, fully offline, free and open sourceβ50Updated 9 months ago
- ONNX Inference of Pyannote Segmentationβ97Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denβ¦β109Updated last year
- An onnx-exportable implementation of iSTFT in torchβ32Updated 11 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime portβ¦β29Updated 5 months ago
- Running the F5-TTS by ONNX Runtimeβ191Updated last month
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigenβ53Updated 10 months ago
- Using OpenVINO to speed up MeloTTS inferenceβ15Updated last year
- Converting Chinese sentences into pinyin sequences, implemented in C++, very fast and easy to deploy.β19Updated last month
- Onnx compatible styletts2 codeβ17Updated 7 months ago
- β21Updated 9 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β106Updated last year
- C++ version of pyannote audio speaker diarizaiton pipelineβ22Updated last year
- StyleTTS 2 Optimized Training Forkβ33Updated last year
- Experiments to test different speech recognition systems for SEPIA Frameworkβ63Updated 2 years ago
- β55Updated 3 weeks ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the β¦β54Updated last year
- ONNX and TensorRT implementation of Whisperβ66Updated 2 years ago
- ncnn HiFi-GANβ29Updated last year
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.β149Updated last week