cirosilvano / easyvad
Simple, energy-based voice activity detection algorithm implementation.
☆17Updated last year
Alternatives and similar repositories for easyvad:
Users that are interested in easyvad are comparing it to the libraries listed below
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆21Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- Port of Funasr's Paraformer model in C/C++☆32Updated 10 months ago
- A fork of Lyra V2 (a low-bitrate neural audio codec) that supports a webassembly build.☆27Updated 2 years ago
- ☆29Updated 3 years ago
- A SQLite extension for working with float and binary vectors. Work in progress!☆20Updated 2 years ago
- A fork of Lyra (version 1) that supports a webassembly build. See https://github.com/mayitayew/soundstream-wasm for a more recent version…☆25Updated 2 years ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆39Updated 8 months ago
- Read-only unofficial mirror of OpenFst☆44Updated 2 years ago
- Speech-end detection library, based on WebRTC's VAD engine☆21Updated 10 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated last year
- numpy ufuncs for vector similarity☆14Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated last year
- ☆13Updated 2 years ago
- Speech-to-text transcription VST3/ARA plugin☆36Updated this week
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆12Updated 7 months ago
- Coqui Inference Engine☆38Updated 3 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated last year
- Port of Suno AI's Bark in C/C++ for fast inference☆52Updated last year
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆52Updated last week
- ☆22Updated 3 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆96Updated 2 years ago
- ☆24Updated 2 years ago
- Proof of concept for running moshi/hibiki using webrtc☆18Updated last month
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- Real Time (WebRTC & WebTransport) Proxy for LLM WebSocket APIs☆29Updated 3 months ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆29Updated 9 months ago
- A simple node.js MRCP (v.2) library☆11Updated 6 months ago
- Rust crate for some audio utilities☆22Updated last month
- Port of Meta's Encodec in C/C++☆218Updated 4 months ago