linto-ai / WebVoiceSDK
Buildings block for voice-enabled applications in the browser
β36Updated last month
Alternatives and similar repositories for WebVoiceSDK:
Users that are interested in WebVoiceSDK are comparing it to the libraries listed below
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeScβ¦β28Updated 7 months ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- GUI Tool to create, manage and test Keyword Spotting models using TF 2.0β12Updated 4 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β47Updated last year
- Web app for keyword spotting using TensorflowJSβ70Updated 2 years ago
- An even smaller speech recognizer / force alignerβ32Updated 2 months ago
- Smart Whisper is a native Node.js addon designed for efficient and streamlined interaction with the whisper.cpp, with automatic model offβ¦β46Updated this week
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voiceβ10Updated 4 years ago
- On-device voice activity detection (VAD) powered by deep learningβ201Updated last week
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.β47Updated 7 months ago
- Wasm Port of Recurrent neural network for audio noise reduction. Based on xiph/rnnoise C++ projectβ42Updated 4 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.β112Updated last year
- Speaker diarization serviceβ21Updated 3 weeks ago
- β16Updated last year
- web based editor for subtitles and transcriptsβ123Updated 6 months ago
- Web Browser Audio Detection/Speech Recording Events APIβ73Updated 2 years ago
- β36Updated 11 months ago
- Speaker Diarization with Transformersβ64Updated 9 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ94Updated 10 months ago
- A library for real-time voice processing in web browsersβ211Updated last month
- gentle forced alignerβ11Updated 10 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β123Updated 4 months ago
- kokoro text to speech using javascriptβ54Updated last month
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 4 years ago
- ez audio transcription tool with flexible processing and post-processing optionsβ146Updated last year
- A curated list of awesome voice activity detectionβ40Updated 3 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.β66Updated last year
- speech-recorder is a node.js module for streaming audio from a device's microphone and filtering for speech.β90Updated last year
- Joint speech-language model - respond directly to audio!β30Updated 10 months ago
- Experiments to test different speech recognition systems for SEPIA Frameworkβ58Updated last year