cirosilvano / easyvadLinks
Simple, energy-based voice activity detection algorithm implementation.
☆17Updated last year
Alternatives and similar repositories for easyvad
Users that are interested in easyvad are comparing it to the libraries listed below
Sorting:
- Open source cross-platform implementation of MRCP protocol☆20Updated 3 years ago
- Port of Funasr's Paraformer model in C/C++☆33Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆21Updated 2 years ago
- A fork of Lyra V2 (a low-bitrate neural audio codec) that supports a webassembly build.☆29Updated 2 years ago
- A FreeSWITCH module to interface to your speech recognition server over websocket☆36Updated last month
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- Speech-end detection library, based on WebRTC's VAD engine☆26Updated 3 months ago
- Open models for Coqui STT☆141Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆57Updated last year
- ☆30Updated 3 years ago
- openvino version of openai/whisper☆170Updated last year
- ☆22Updated 4 years ago
- Port of Meta's Encodec in C/C++☆226Updated 8 months ago
- On-device voice activity detection (VAD) powered by deep learning☆223Updated this week
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆70Updated this week
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆107Updated 2 years ago
- A java wrapper around the WebRTC Voice Activity Detection library☆61Updated 4 years ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆118Updated 2 years ago
- FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile softw…☆29Updated 3 years ago
- An example of using pion/opus in WASM to decode audio files - https://sean-der.github.io/wasm-audio-decode/☆15Updated 2 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 9 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆23Updated last week
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆29Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- Whisper.cpp Speech-to-text with Voice Acticity Detection☆19Updated 9 months ago
- Coqui Inference Engine☆41Updated 4 years ago