OzymandiasTheGreat / libfvad-wasmLinks
Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeScript
☆31Updated last year
Alternatives and similar repositories for libfvad-wasm
Users that are interested in libfvad-wasm are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆227Updated 2 weeks ago
- Buildings block for voice-enabled applications in the browser☆37Updated 4 months ago
- rnnoise noise suppression library as a WASM module☆155Updated 6 months ago
- openvino version of openai/whisper☆174Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆118Updated 2 years ago
- A library for real-time voice processing in web browsers☆229Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆30Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- An echo cancellation library for browsers using DTLN-aec☆26Updated last year
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆27Updated 4 months ago
- An even smaller speech recognizer / force aligner☆35Updated 8 months ago
- Web Browser Audio Detection/Speech Recording Events API☆75Updated 3 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- web based editor for subtitles and transcripts☆140Updated last year
- Speaker diarization model☆28Updated 2 years ago
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆478Updated last year
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆69Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆129Updated 9 months ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- streaming speech to text server using Whisper☆94Updated 2 years ago
- On-device speech-to-text engine powered by deep learning☆459Updated this week
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆49Updated last year
- ☆27Updated 2 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- generate granular word-level captions in srt format☆57Updated 2 years ago
- On-device noise suppression powered by deep learning☆75Updated 3 weeks ago
- A curated list of awesome voice activity detection☆62Updated 9 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆79Updated 2 years ago