OzymandiasTheGreat / libfvad-wasmLinks

Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeScript

☆31

Alternatives and similar repositories for libfvad-wasm

Users that are interested in libfvad-wasm are comparing it to the libraries listed below

Sorting:

Picovoice / cobra
On-device voice activity detection (VAD) powered by deep learning
☆222Updated 2 weeks ago
linto-ai / WebVoiceSDK
Buildings block for voice-enabled applications in the browser
☆37Updated 3 months ago
appvoid / vosper
Real-Time Whisper Voice Recognition with vosk model feedback.
☆117Updated 2 years ago
Picovoice / web-voice-processor
A library for real-time voice processing in web browsers
☆226Updated 5 months ago
jitsi / rnnoise-wasm
rnnoise noise suppression library as a WASM module
☆155Updated 6 months ago
Snirpo / node-vad
Voice activation detection library for NodeJS
☆64Updated 5 years ago
SEPIA-Framework / sepia-web-audio
Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…
☆47Updated 2 years ago
nalbion / whisper-server
streaming speech to text server using Whisper
☆94Updated 2 years ago
solyarisoftware / WeBAD
Web Browser Audio Detection/Speech Recording Events API
☆75Updated 3 years ago
hedrergudene / asr-sd-pipeline
Speech recognition & diarisation solution with text alignment, deployed in AML pipelines
☆96Updated last year
ccoreilly / vosk-browser
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
☆469Updated last year
Picovoice / falcon
On-device speaker diarization powered by deep learning
☆52Updated 3 weeks ago
SEPIA-Framework / sepia-stt-server
SEPIA server to support open-source speech recognition via WebSocket connection.
☆128Updated 8 months ago
atyenoria / livekit-whisper-transcribe
☆26Updated 2 years ago
geekodour / wscribe-editor
web based editor for subtitles and transcripts
☆137Updated 11 months ago
meronym / speaker-diarization
Speaker diarization model
☆28Updated 2 years ago
zhuzilin / whisper-openvino
openvino version of openai/whisper
☆170Updated last year
Picovoice / koala
On-device noise suppression powered by deep learning
☆73Updated 3 weeks ago
linto-ai / linto-stt
An automatic speech recognition API
☆66Updated 2 weeks ago
luweigen / whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
☆120Updated last year
bigcash / awesome-vad
A curated list of awesome voice activity detection
☆59Updated 8 months ago
mesolitica / vllm-whisper
A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper
☆28Updated last year
latishab / turnsense
A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.
☆25Updated 4 months ago
futo-org / whisper-acft
☆150Updated last year
JonathanFly / faster-whisper-livestream-translator
faster-whisper livestream translation, OBS noise reduction, dual language subtitles
☆79Updated 2 years ago
ReadAlongs / SoundSwallower
An even smaller speech recognizer / force aligner
☆35Updated 7 months ago
litongjava / whisper-cpp-server
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
☆68Updated last year
geekodour / wscribe
ez audio transcription tool with flexible processing and post-processing options
☆156Updated last year
Wordcab / wordcab-transcribe
💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.
☆215Updated 9 months ago
YuanGongND / whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …
☆400Updated last year