cirosilvano / easyvadLinks
Simple, energy-based voice activity detection algorithm implementation.
☆17Updated last year
Alternatives and similar repositories for easyvad
Users that are interested in easyvad are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆220Updated last week
- A fork of Lyra V2 (a low-bitrate neural audio codec) that supports a webassembly build.☆29Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆32Updated last year
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆42Updated 10 months ago
- ☆30Updated 3 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- Tunable pipelines☆34Updated 4 months ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆30Updated this week
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆22Updated this week
- ☆22Updated 4 years ago
- Port of Meta's Encodec in C/C++☆226Updated 7 months ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆67Updated 2 weeks ago
- Coqui Inference Engine☆40Updated 3 years ago
- A java wrapper around the WebRTC Voice Activity Detection library☆61Updated 4 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆105Updated 2 years ago
- phonetic similarity algorithms☆13Updated 7 years ago
- openvino version of openai/whisper☆168Updated last year
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- Open models for Coqui STT☆143Updated 2 years ago
- A fork of Lyra (version 1) that supports a webassembly build. See https://github.com/mayitayew/soundstream-wasm for a more recent version…☆25Updated 3 years ago
- ONNX Inference of Pyannote Segmentation☆92Updated 6 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Updated last year
- Whisper.cpp Speech-to-text with Voice Acticity Detection☆19Updated 8 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- An example of using pion/opus in WASM to decode audio files - https://sean-der.github.io/wasm-audio-decode/☆15Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago