cirosilvano / easyvadLinks
Simple, energy-based voice activity detection algorithm implementation.
☆18Updated last year
Alternatives and similar repositories for easyvad
Users that are interested in easyvad are comparing it to the libraries listed below
Sorting:
- Port of Funasr's Paraformer model in C/C++☆39Updated last year
- ☆32Updated 3 years ago
- A fork of Lyra V2 (a low-bitrate neural audio codec) that supports a webassembly build.☆30Updated 3 years ago
- A FreeSWITCH module to interface to your speech recognition server over websocket☆37Updated 7 months ago
- Speech-end detection library, based on WebRTC's VAD engine☆26Updated 8 months ago
- An example of using pion/opus in WASM to decode audio files - https://sean-der.github.io/wasm-audio-decode/☆15Updated 3 years ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆43Updated last year
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆20Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆63Updated 2 years ago
- Port of Meta's Encodec in C/C++☆227Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆243Updated last week
- A java wrapper around the WebRTC Voice Activity Detection library☆66Updated 4 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆33Updated 2 years ago
- Open models for Coqui STT☆150Updated 2 years ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile softw…☆30Updated 3 years ago
- Audio Loudness Normalization Filter Port From FFmpeg☆12Updated 6 years ago
- Open source cross-platform implementation of MRCP protocol☆20Updated 3 years ago
- Whisper.cpp Speech-to-text with Voice Acticity Detection☆20Updated last year
- a implementation of Shazam algorithm☆49Updated 6 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆124Updated 2 years ago
- Coqui Inference Engine☆40Updated 4 years ago
- An echo cancellation library for browsers using DTLN-aec☆26Updated 2 years ago
- Python Audio Separator in Real Time using MDX-NET model☆24Updated 2 years ago
- openvino version of openai/whisper☆180Updated 2 years ago
- freeswitch百度语音识别模块☆25Updated 4 years ago
- Read-only unofficial mirror of OpenFst☆44Updated 3 years ago
- A ggml (C++) re-implementation of tortoise-tts☆193Updated last year
- YapHash is a perceptual fingerprint for audio identification purposes. This is the standalone version of the VIAT featureX☆40Updated 12 years ago
- ONNX Inference of Pyannote Segmentation☆97Updated last year