gumblex / whisper_vadLinks
Whisper.cpp Speech-to-text with Voice Acticity Detection
☆20Updated last year
Alternatives and similar repositories for whisper_vad
Users that are interested in whisper_vad are comparing it to the libraries listed below
Sorting:
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆16Updated last year
- openvino version of openai/whisper☆15Updated last year
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆53Updated 9 months ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆62Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆39Updated last year
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆13Updated 3 years ago
- The CPP version of Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆23Updated last year
- Detecting segments belonging to which song in database, and return Nil if does not exist in a database.☆22Updated 4 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆33Updated 2 years ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆89Updated 3 months ago
- C++ library for converting text to phonemes for Piper☆137Updated 5 months ago
- A fast MP3 decoder for python, using minimp3☆29Updated 3 years ago
- ONNX implementation of Whisper. PyTorch free.☆102Updated last year
- ncnn HiFi-GAN☆29Updated last year
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆43Updated last year
- Port of Meta's Encodec in C/C++☆227Updated last year
- Zero-shot Audio Classification using Whisper☆79Updated 3 years ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆138Updated 3 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆40Updated 10 months ago
- On-device noise suppression powered by deep learning☆78Updated last week
- Onnx compatible styletts2 code☆16Updated 7 months ago
- IPA Phonemizer/Dephonemizer for 144 human languages☆50Updated last week
- Python Audio Separator in Real Time using MDX-NET model☆24Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Updated last year
- Uses machine learning to denoise audio containing speech☆48Updated last year
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆48Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆160Updated last year
- A lightweight wrapper around https://github.com/facebookresearch/encodec that enables dynamic streamed reading, seeking, metadata and GPU…☆15Updated last year