cirosilvano / easyvad
Simple, energy-based voice activity detection algorithm implementation.
☆17Updated last year
Alternatives and similar repositories for easyvad
Users that are interested in easyvad are comparing it to the libraries listed below
Sorting:
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆21Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- A fork of Lyra V2 (a low-bitrate neural audio codec) that supports a webassembly build.☆28Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆31Updated 10 months ago
- Coqui Inference Engine☆40Updated 3 years ago
- An example of using pion/opus in WASM to decode audio files - https://sean-der.github.io/wasm-audio-decode/☆15Updated 2 years ago
- An echo cancellation library for browsers using DTLN-aec☆26Updated last year
- Speech-end detection library, based on WebRTC's VAD engine☆22Updated last week
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆40Updated 8 months ago
- A java wrapper around the WebRTC Voice Activity Detection library☆61Updated 3 years ago
- ☆22Updated 3 years ago
- A fork of Lyra (version 1) that supports a webassembly build. See https://github.com/mayitayew/soundstream-wasm for a more recent version…☆25Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆17Updated last year
- Port of Suno AI's Bark in C/C++ for fast inference☆53Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated 2 years ago
- phonetic similarity algorithms☆13Updated 6 years ago
- TTS support with GGML☆35Updated this week
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- Using OpenVINO to speed up MeloTTS inference☆11Updated 6 months ago
- Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retriev…☆45Updated last year
- ☆30Updated 3 years ago
- Speech-to-text transcription VST3/ARA plugin☆41Updated this week
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆19Updated 8 months ago
- On-device voice activity detection (VAD) powered by deep learning☆214Updated last week
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆65Updated last year
- RNNoise for WASM☆52Updated 4 years ago
- Unofficial C binding for Onnxruntime in Golang.☆18Updated 3 months ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆24Updated last month
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year