seanghay / vits.cppLinks
VITS Inference using ONNX Runtime on C++
☆13Updated 2 years ago
Alternatives and similar repositories for vits.cpp
Users that are interested in vits.cpp are comparing it to the libraries listed below
Sorting:
- ☆29Updated last year
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆62Updated 5 months ago
- silero-vad pytorch implement☆34Updated last year
- Megatts2 use HierSpeechpp's vocoder☆18Updated last year
- Went online decode demo☆31Updated 4 years ago
- Colab notebooks for Next-gen Kaldi☆29Updated 3 months ago
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 6 months ago
- Unofficial implementation of wavenext vocoder☆56Updated last year
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆79Updated 3 years ago
- CTC decoder with hotwords for ASR.☆34Updated 9 months ago
- ☆82Updated last year
- Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation☆134Updated 2 weeks ago
- Training code and dataset cleasing with Sidon☆75Updated 3 weeks ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆94Updated 10 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆34Updated 3 weeks ago
- Python Wrapper of Silero VAD☆64Updated 9 months ago
- pytorch model for contexless-phoneme prediction from speech audio☆30Updated 3 months ago
- Clustering-based methods for overlapping diarization☆82Updated 2 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆83Updated 7 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Updated last year
- Implementation of TTS model based on NVIDIA P-Flow TTS Paper☆77Updated last year
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆78Updated last year
- High quality text-to-speech based on StyleTTS 2.☆71Updated last month
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆17Updated last year
- All generative model in one for better TTS model☆74Updated last year
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆59Updated last year
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆79Updated 7 months ago
- a lightweight voice conversion☆86Updated last year
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆43Updated 3 years ago