gumblex / whisper_vadLinks
Whisper.cpp Speech-to-text with Voice Acticity Detection
☆16Updated 7 months ago
Alternatives and similar repositories for whisper_vad
Users that are interested in whisper_vad are comparing it to the libraries listed below
Sorting:
- Inference TinyLlama models on ncnn☆24Updated last year
- Port of Funasr's Paraformer model in C/C++☆31Updated 11 months ago
- ez audio transcription tool with flexible processing and post-processing options☆151Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 8 months ago
- Python bindings of speexdsp noise suppression library☆38Updated 2 years ago
- ☆42Updated last week
- ☆11Updated 3 years ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆109Updated 2 weeks ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆14Updated 6 months ago
- NNSE (Neural Network Speech Enhancement) is a speech-denoiser optimized to run on Ambiq's low power platform☆39Updated 3 weeks ago
- Using OpenVINO to speed up MeloTTS inference☆11Updated 7 months ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆40Updated 9 months ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆47Updated 2 months ago
- ☆16Updated last year
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆33Updated last year
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆75Updated 9 months ago
- A fast MP3 decoder for python, using minimp3☆28Updated 2 years ago
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆14Updated 8 months ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆24Updated 2 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆100Updated 2 years ago
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆13Updated 2 years ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆27Updated last month
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 7 months ago
- openvino version of openai/whisper☆13Updated 7 months ago
- ONNX and TensorRT implementation of Whisper☆63Updated 2 years ago
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago