DictationDaddy / VAD_WEB_DEMOLinks
In this repository, I show you how to use SILERO VAD with ONNX-WEB runtime to run the VAD compeletely in the browser.
☆24Updated 8 months ago
Alternatives and similar repositories for VAD_WEB_DEMO
Users that are interested in VAD_WEB_DEMO are comparing it to the libraries listed below
Sorting:
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 9 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆136Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆223Updated 7 months ago
- Thin wrapper around OpenAI Whisper API with streaming support☆89Updated 8 months ago
- CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search☆65Updated last month
- streaming speech to text server using Whisper☆94Updated 2 years ago
- React / Vanilla JS Text to Speech with highlighting the words and sentences that are being spoken using audio files, text to speech API, …☆170Updated last week
- ☆27Updated 2 years ago
- Real-Time Voice Inference Web SDK☆287Updated this week
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆98Updated last month
- ☆293Updated last week
- faster-whisper as serverless endpoint☆118Updated 4 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆302Updated 10 months ago
- Real-time voice agent powered by Agora and OpenAI☆94Updated last month
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- A simple voice assistant example built with Next.js and LiveKit React Components☆295Updated this week
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆74Updated this week
- Record and stream WAV audio data in the browser across all platforms☆88Updated 10 months ago
- Open source inference code for Rev's model☆429Updated 5 months ago
- LiveKit real-time and server SDKs for Python☆266Updated this week
- Run OpenAI Whisper as a Cog model☆64Updated 10 months ago
- Play with OpenAI's new Realtime API in your browser☆335Updated last week
- ☆44Updated this week
- Efficient approach to speaker diarization using voice characteristics extraction☆100Updated 3 months ago
- Have a natural voice conversation with an LLM☆256Updated 9 months ago
- Create Animated Subtitles From .SRT files in Remotion☆68Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆218Updated 10 months ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆246Updated 2 weeks ago
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆71Updated last year