gumblex / whisper_vad
Whisper.cpp Speech-to-text with Voice Acticity Detection
☆16Updated 5 months ago
Alternatives and similar repositories for whisper_vad:
Users that are interested in whisper_vad are comparing it to the libraries listed below
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain only☆20Updated 8 months ago
- Port of Funasr's Paraformer model in C/C++☆30Updated 9 months ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆39Updated 7 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆12Updated 6 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆18Updated 5 months ago
- Speech-end detection library, based on WebRTC's VAD engine☆21Updated 9 months ago
- Inference TinyLlama models on ncnn☆24Updated last year
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated last year
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆50Updated this week
- ☆43Updated this week
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆13Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆59Updated last year
- Python bindings of speexdsp noise suppression library☆38Updated 2 years ago
- ez audio transcription tool with flexible processing and post-processing options☆148Updated last year
- TTS inference in C++ based on TFlite model☆18Updated 4 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆88Updated last year
- 用rtaudio来采集、播放,并用speexdsp来做回声消除。☆18Updated 6 years ago
- A chat UI for Llama.cpp☆12Updated this week
- C++ version of pyannote audio speaker diarizaiton pipeline☆20Updated last year
- Web browser version of StarCoder.cpp☆44Updated last year
- Native C and C++ implementation of RAISR (Rapid and Accurate Image Super-Resolution). Intel Video Super Resolution Library☆71Updated 3 weeks ago
- A library for computing spectrograms and periodograms☆8Updated 6 years ago
- Inference RWKV with multiple supported backends.☆39Updated this week
- Realtime Style transfer using WebRTC (pion), ffmpeg and Tensorflow.☆24Updated 3 years ago
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆61Updated 10 months ago
- an improved version of Real-time-voice-cloning☆48Updated last year
- ☆11Updated 3 years ago
- Experiments with BitNet inference on CPU☆53Updated last year
- Speaker diarization service☆21Updated last month