IntendedConsequence / vadc
Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech
☆10Updated last month
Related projects ⓘ
Alternatives and complementary repositories for vadc
- ncnn HiFi-GAN☆24Updated last month
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting mixed English and Chinese languages.☆18Updated this week
- Port of Funasr's Paraformer model in C/C++☆25Updated 4 months ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- noise reduction☆17Updated 4 months ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆56Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆43Updated 2 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆14Updated this week
- A library for adding punctuation into a text from ASR.☆17Updated last year
- ☆16Updated 7 months ago
- Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]☆8Updated 4 months ago
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆60Updated 3 weeks ago
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆40Updated 2 years ago
- some ncnn demos of FunASR☆16Updated last month
- Running the F5-TTS by ONNX Runtime☆27Updated last week
- ☆10Updated last year
- Python bindings of speexdsp noise suppression library☆35Updated last year
- ☆10Updated last year
- This is an unofficial Pytorch implementation of the DTLN model repository, which contains denoising and inference code for the DTLN model…☆12Updated last year
- Voice Framework☆11Updated this week
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆24Updated last week
- ☆27Updated 3 years ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- Project of Singing Voice Conversion.☆14Updated last year
- This is a TTS model based on VITS that can control the output speech emotion through natural language and control the speaker through ref…☆4Updated 2 months ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 4 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆31Updated last year
- ☆30Updated 3 years ago
- ☆10Updated 2 months ago