ricky0123 / vadLinks
Voice activity detector (VAD) for the browser with a simple API
☆1,650Updated this week
Alternatives and similar repositories for vad
Users that are interested in vad are comparing it to the libraries listed below
Sorting:
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,388Updated last month
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS☆923Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆7,120Updated last week
- Local SRT/LLM/TTS Voicechat☆732Updated last year
- Converts text to speech in realtime☆3,589Updated 3 months ago
- A nearly-live implementation of OpenAI's Whisper.☆3,488Updated last month
- ☆982Updated last month
- A python package to build AI-powered real-time audio applications☆1,484Updated 8 months ago
- React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in☆782Updated last year
- ☆2,527Updated this week
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆847Updated 4 months ago
- Node.js + JavaScript reference client for the Realtime API (beta)☆1,000Updated 11 months ago
- Interface for OuteTTS models.☆1,390Updated 4 months ago
- Build realtime multimodal AI agents with Node.js☆600Updated this week
- Example UI implementing the RTVI web client☆477Updated 10 months ago
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆2,634Updated last month
- ML-powered speech recognition directly in your browser☆3,117Updated last year
- first base model for full-duplex conversational audio☆1,766Updated 9 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,632Updated last year
- Real-Time Voice Inference Web SDK☆287Updated this week
- An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.☆1,469Updated last week
- Command Your World with Voice☆765Updated 4 months ago
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,938Updated last week
- Whisper with Medusa heads☆862Updated 2 months ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,050Updated last week
- Real time transcription with OpenAI Whisper.☆2,872Updated 6 months ago
- React app for inspecting, building and debugging with the Realtime API☆3,483Updated last month
- Local realtime voice AI☆2,372Updated 7 months ago
- A fast multimodal LLM for real-time voice☆4,226Updated last month
- Open Source framework for voice and multimodal conversational AI☆8,484Updated this week