linto-ai / WebVoiceSDK
Buildings block for voice-enabled applications in the browser
☆37Updated 3 weeks ago
Alternatives and similar repositories for WebVoiceSDK:
Users that are interested in WebVoiceSDK are comparing it to the libraries listed below
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆29Updated 9 months ago
- Web Browser Audio Detection/Speech Recording Events API☆74Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- ☆16Updated last year
- An even smaller speech recognizer / force aligner☆32Updated 4 months ago
- A library for real-time voice processing in web browsers☆215Updated 3 months ago
- Smart Whisper is a native Node.js addon designed for efficient and streamlined interaction with the whisper.cpp, with automatic model off…☆53Updated last week
- Web app for keyword spotting using TensorflowJS☆71Updated 2 years ago
- ☆36Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆213Updated this week
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.☆28Updated 3 years ago
- GUI Tool to create, manage and test Keyword Spotting models using TF 2.0☆12Updated 4 years ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- Voice activation detection library for NodeJS☆55Updated 5 years ago
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- streaming speech to text server using Whisper☆92Updated last year
- ☆26Updated 3 years ago
- Speaker Diarization with Transformers☆64Updated 11 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆114Updated last year
- Speaker diarization service☆22Updated 3 weeks ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆28Updated last year
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆21Updated last week
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.☆47Updated 3 years ago
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆24Updated last year
- Voice data <= 10 mins can also be used to train a good VC model!☆12Updated last year
- A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.☆16Updated this week