maxilevi / vits.cpp
a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile devices. this is my undergraduate project
☆29Updated 3 weeks ago
Related projects: ⓘ
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆10Updated 3 weeks ago
- Pytorch implementation of SoundCTM☆68Updated 3 weeks ago
- Unofficial implementation of wavenext vocoder☆28Updated 3 weeks ago
- Multispeaker Community Vocoder Model for DiffSinger☆34Updated 4 months ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆31Updated last year
- An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".☆45Updated last month
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 7 months ago
- Zero-Shot Emotion Style Transfer☆33Updated 5 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆74Updated 2 months ago
- Lyra V2 (SoundStream) running in the browser☆17Updated last year
- ONNX deployment of the CREPE pitch tracker☆20Updated last year
- FlashSpeech: Efficient Zero-Shot Speech Synthesis☆64Updated last month
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆34Updated last month
- VALL-E 2 reproduction☆72Updated 2 months ago
- ☆31Updated last month
- ☆12Updated this week
- A lightweight wrapper around https://github.com/facebookresearch/encodec that enables dynamic streamed reading, seeking, metadata and GPU…☆11Updated 4 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆20Updated 4 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆18Updated 2 months ago
- ☆19Updated this week
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆18Updated 4 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆28Updated last year
- BEGANSing - Korean SVS + SVC + AudioSR☆12Updated 7 months ago
- Non Parallel Voice Conversion based on VITS☆23Updated last year
- Fine-Tune Whisper with Transformers and PEFT☆26Updated 10 months ago
- ☆27Updated 10 months ago
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆104Updated last month
- LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances …☆30Updated last week
- a lightweight voice conversion☆78Updated 2 weeks ago
- ☆33Updated 5 months ago