cirosilvano / easyvadLinks
Simple, energy-based voice activity detection algorithm implementation.
☆17Updated last year
Alternatives and similar repositories for easyvad
Users that are interested in easyvad are comparing it to the libraries listed below
Sorting:
- Port of Funasr's Paraformer model in C/C++☆34Updated last year
- A fork of Lyra V2 (a low-bitrate neural audio codec) that supports a webassembly build.☆30Updated 2 years ago
- ☆31Updated 3 years ago
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆21Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆229Updated 2 weeks ago
- Speech-end detection library, based on WebRTC's VAD engine☆26Updated 5 months ago
- Port of Meta's Encodec in C/C++☆222Updated 10 months ago
- A FreeSWITCH module to interface to your speech recognition server over websocket☆36Updated 3 months ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆42Updated last year
- Open models for Coqui STT☆144Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆63Updated 2 years ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆82Updated 2 weeks ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- Open source cross-platform implementation of MRCP protocol☆20Updated 3 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆113Updated 2 years ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆85Updated last year
- A java wrapper around the WebRTC Voice Activity Detection library☆66Updated 4 years ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆49Updated 6 months ago
- Read-only unofficial mirror of OpenFst☆44Updated 3 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆131Updated 11 months ago
- Google Chrome Text to Speech command line client☆34Updated 4 years ago
- Coqui Inference Engine☆41Updated 4 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- ☆22Updated 4 years ago
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- Whisper.cpp Speech-to-text with Voice Acticity Detection☆19Updated 11 months ago
- ONNX Inference of Pyannote Segmentation☆93Updated 9 months ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- A curated list of awesome voice activity detection☆66Updated 10 months ago