linto-ai / WebVoiceSDKLinks
Buildings block for voice-enabled applications in the browser
☆37Updated 8 months ago
Alternatives and similar repositories for WebVoiceSDK
Users that are interested in WebVoiceSDK are comparing it to the libraries listed below
Sorting:
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆46Updated 2 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆31Updated last year
- Web Browser Audio Detection/Speech Recording Events API☆77Updated 3 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 5 years ago
- A library for real-time voice processing in web browsers☆237Updated 2 weeks ago
- An even smaller speech recognizer / force aligner☆37Updated last year
- 🐍 Coqui's machine learning job scheduler☆31Updated 4 years ago
- On-device voice activity detection (VAD) powered by deep learning☆241Updated last week
- Web app for keyword spotting using TensorflowJS☆74Updated 3 years ago
- ☆16Updated 2 years ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆29Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆135Updated last year
- Smart Whisper is a native Node.js addon designed for efficient and streamlined interaction with the whisper.cpp, with automatic model off…☆71Updated this week
- Transcription with speaker diarization pipeline☆97Updated 2 years ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆43Updated 2 months ago
- Simple text to phones converter using eSpeak NG.☆41Updated last year
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆25Updated 2 years ago
- streaming speech to text server using Whisper☆98Updated 2 years ago
- web based editor for subtitles and transcripts☆142Updated last year
- ☆38Updated last year
- SemanticFinder - frontend-only live semantic search with transformers.js☆319Updated 9 months ago
- ☆26Updated 3 years ago
- OpenAI's CLIP model ported to JavaScript using the ONNX web runtime☆157Updated 2 years ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year
- Browser-compatible JS library for running language models☆232Updated 3 years ago
- A React component to make correcting automated transcriptions of audio and video easier and faster. Using the SlateJs editor.☆84Updated 3 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Speaker diarization model☆32Updated 2 years ago