cirosilvano / easyvadLinks
Simple, energy-based voice activity detection algorithm implementation.
☆17Updated last year
Alternatives and similar repositories for easyvad
Users that are interested in easyvad are comparing it to the libraries listed below
Sorting:
- Port of Funasr's Paraformer model in C/C++☆35Updated last year
- A fork of Lyra V2 (a low-bitrate neural audio codec) that supports a webassembly build.☆30Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆228Updated last month
- Port of Meta's Encodec in C/C++☆226Updated 9 months ago
- Open models for Coqui STT☆142Updated 2 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- Read-only unofficial mirror of OpenFst☆44Updated 3 years ago
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆21Updated 2 years ago
- Google Chrome Text to Speech command line client☆34Updated 4 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆112Updated 2 years ago
- A FreeSWITCH module to interface to your speech recognition server over websocket☆36Updated 2 months ago
- Speech-end detection library, based on WebRTC's VAD engine☆26Updated 4 months ago
- Open source cross-platform implementation of MRCP protocol☆20Updated 3 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- ☆31Updated 3 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- openvino version of openai/whisper☆174Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Updated last year
- An example of using pion/opus in WASM to decode audio files - https://sean-der.github.io/wasm-audio-decode/☆15Updated 2 years ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- Whisper.cpp Speech-to-text with Voice Acticity Detection☆19Updated 10 months ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆47Updated 6 months ago
- ONNX Inference of Pyannote Segmentation☆93Updated 8 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆85Updated last year
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆77Updated last month
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆31Updated last year
- A java wrapper around the WebRTC Voice Activity Detection library☆65Updated 4 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆31Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year