gumblex / whisper_vadLinks
Whisper.cpp Speech-to-text with Voice Acticity Detection
☆19Updated 8 months ago
Alternatives and similar repositories for whisper_vad
Users that are interested in whisper_vad are comparing it to the libraries listed below
Sorting:
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆42Updated 10 months ago
- Port of Funasr's Paraformer model in C/C++☆32Updated last year
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆15Updated 9 months ago
- XCORE-VOICE Solution☆15Updated last month
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- NNSE (Neural Network Speech Enhancement) is a speech-denoiser optimized to run on Ambiq's low power platform☆39Updated 2 months ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆67Updated last week
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- ☆11Updated 3 years ago
- Open models for Coqui STT☆141Updated 2 years ago
- ncnn HiFi-GAN☆26Updated 9 months ago
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain only☆22Updated 11 months ago
- Python bindings of speexdsp noise suppression library☆39Updated 2 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- C code to extract mfcc or fbank features from wav files☆16Updated 5 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆103Updated 2 years ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆48Updated 4 months ago
- TTS inference in C++ based on TFlite model☆18Updated 4 years ago
- Inference TinyLlama models on ncnn☆24Updated last year
- noise reduction☆17Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆79Updated 11 months ago
- The CPP version of Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆21Updated last year
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆35Updated 3 months ago
- Using OpenVINO to speed up MeloTTS inference☆12Updated 8 months ago
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆13Updated 2 years ago
- Project of Singing Voice Conversion.☆15Updated last year
- An implementation of MeloTTS by onnxruntime☆23Updated 8 months ago
- ☆59Updated 2 weeks ago
- Experiments with BitNet inference on CPU☆54Updated last year