linto-ai / WebVoiceSDKLinks
Buildings block for voice-enabled applications in the browser
☆37Updated last week
Alternatives and similar repositories for WebVoiceSDK
Users that are interested in WebVoiceSDK are comparing it to the libraries listed below
Sorting:
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆46Updated 2 years ago
- Web Browser Audio Detection/Speech Recording Events API☆78Updated 3 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆31Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- A library for real-time voice processing in web browsers☆238Updated 3 weeks ago
- ☆26Updated 3 years ago
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆25Updated 2 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 5 years ago
- 🐍 Coqui's machine learning job scheduler☆31Updated 4 years ago
- An even smaller speech recognizer / force aligner☆37Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆242Updated 3 weeks ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆31Updated last week
- streaming speech to text server using Whisper☆101Updated 2 years ago
- Web app for keyword spotting using TensorflowJS☆74Updated 3 years ago
- ☆108Updated 10 months ago
- web based editor for subtitles and transcripts☆143Updated last year
- ☆16Updated 2 years ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆45Updated 3 months ago
- Simple text to phones converter using eSpeak NG.☆42Updated last year
- ☆38Updated last year
- Transcription with speaker diarization pipeline☆98Updated 2 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆135Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- GUI Tool to create, manage and test Keyword Spotting models using TF 2.0☆13Updated 5 years ago
- A realtime drawing game showcasing the use of LiveKit data capabilities in an Agents-based app.☆44Updated this week
- On-device Speech-to-Index engine powered by deep learning☆36Updated 9 months ago
- 📈 A forced aligner intended for synchronization of narrated text☆102Updated 6 months ago
- ☆18Updated 4 years ago