OzymandiasTheGreat / libfvad-wasmLinks
Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeScript
☆31Updated last year
Alternatives and similar repositories for libfvad-wasm
Users that are interested in libfvad-wasm are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆222Updated 2 weeks ago
- Buildings block for voice-enabled applications in the browser☆37Updated 3 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆117Updated 2 years ago
- A library for real-time voice processing in web browsers☆226Updated 5 months ago
- rnnoise noise suppression library as a WASM module☆155Updated 6 months ago
- Voice activation detection library for NodeJS☆64Updated 5 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- streaming speech to text server using Whisper☆94Updated 2 years ago
- Web Browser Audio Detection/Speech Recording Events API☆75Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆469Updated last year
- On-device speaker diarization powered by deep learning☆52Updated 3 weeks ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 8 months ago
- ☆26Updated 2 years ago
- web based editor for subtitles and transcripts☆137Updated 11 months ago
- Speaker diarization model☆28Updated 2 years ago
- openvino version of openai/whisper☆170Updated last year
- On-device noise suppression powered by deep learning☆73Updated 3 weeks ago
- An automatic speech recognition API☆66Updated 2 weeks ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆120Updated last year
- A curated list of awesome voice activity detection☆59Updated 8 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆28Updated last year
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆25Updated 4 months ago
- ☆150Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆79Updated 2 years ago
- An even smaller speech recognizer / force aligner☆35Updated 7 months ago
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆68Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆156Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆215Updated 9 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆400Updated last year