IntendedConsequence / vadc
Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech
☆13Updated 7 months ago
Alternatives and similar repositories for vadc
Users that are interested in vadc are comparing it to the libraries listed below
Sorting:
- Using OpenVINO to speed up MeloTTS inference☆11Updated 6 months ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆16Updated last year
- ncnn HiFi-GAN☆26Updated 7 months ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆13Updated 5 months ago
- Python bindings of speexdsp noise suppression library☆38Updated 2 years ago
- Project of Singing Voice Conversion.☆14Updated last year
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆37Updated 5 months ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- noise reduction☆17Updated 10 months ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆31Updated 10 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆35Updated 3 weeks ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆14Updated 5 months ago
- mnn asr demo.☆16Updated last month
- Equal Loudness Filter☆10Updated 6 years ago
- ☆12Updated 2 years ago
- Voice Framework☆14Updated last month
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆73Updated 9 months ago
- ☆10Updated 6 months ago
- Supervoice Speaker Separation Network☆12Updated 11 months ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆57Updated last month
- speaker-disentangled speech linguistic content quantizer☆14Updated last month
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated 2 years ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- Voice activity detection and speaker gender segmentation audiovisual corpus☆13Updated 3 months ago
- ☆15Updated 9 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆12Updated 8 months ago
- text to speech☆10Updated last year
- Target speaker automatic speech recognition (TS-ASR)☆11Updated last year
- ☆13Updated 8 months ago