OzymandiasTheGreat / libfvad-wasmLinks
Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeScript
☆30Updated 11 months ago
Alternatives and similar repositories for libfvad-wasm
Users that are interested in libfvad-wasm are comparing it to the libraries listed below
Sorting:
- Buildings block for voice-enabled applications in the browser☆37Updated 2 months ago
- On-device voice activity detection (VAD) powered by deep learning☆218Updated this week
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- Speaker diarization model☆27Updated 2 years ago
- A curated list of awesome voice activity detection☆57Updated 7 months ago
- ONNX Inference of Pyannote Segmentation☆91Updated 6 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- Wasm Port of Recurrent neural network for audio noise reduction. Based on xiph/rnnoise C++ project☆42Updated 4 years ago
- rnnoise noise suppression library as a WASM module☆152Updated 4 months ago
- On-device speaker diarization powered by deep learning☆51Updated this week
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆21Updated 2 months ago
- An echo cancellation library for browsers using DTLN-aec☆26Updated last year
- Voice activation detection library for NodeJS☆57Updated 5 years ago
- An even smaller speech recognizer / force aligner☆33Updated 6 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆115Updated 2 years ago
- streaming speech to text server using Whisper☆93Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆95Updated last year
- IPA Phonemizer/Dephonemizer for 139 human languages☆27Updated 2 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆99Updated 8 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- RNNoise for WASM☆52Updated 4 years ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆28Updated 10 months ago
- On-device noise suppression powered by deep learning☆73Updated this week
- An automatic speech recognition API☆61Updated this week
- Putting flows on top of neural transducers for better TTS☆62Updated this week
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- ☆26Updated 2 years ago
- A fork of Lyra (version 1) that supports a webassembly build. See https://github.com/mayitayew/soundstream-wasm for a more recent version…☆25Updated 2 years ago