DictationDaddy / VAD_WEB_DEMO
In this repository, I show you how to use SILERO VAD with ONNX-WEB runtime to run the VAD compeletely in the browser.
☆17Updated last month
Alternatives and similar repositories for VAD_WEB_DEMO:
Users that are interested in VAD_WEB_DEMO are comparing it to the libraries listed below
- An JS web client for connecting to Pipecat bots with voice and vision☆43Updated 2 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆111Updated last year
- ☆21Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆25Updated this week
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆122Updated 8 months ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆204Updated 3 months ago
- Create Animated Subtitles From .SRT files in Remotion☆41Updated 10 months ago
- Real-Time Voice Inference Web SDK☆192Updated 3 weeks ago
- ☆188Updated this week
- A function to do all☆35Updated 10 months ago
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆26Updated 3 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- Talk to GPT-4 and create a story together.☆87Updated last year
- ☆10Updated 2 years ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆13Updated this week
- a simple system for 2-way interruptible voice interactions between human and LLM☆21Updated last year
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆84Updated last week
- A streaming whisper server for on-prem transcription☆19Updated 6 months ago
- ASR + diarization model server with speculative decoding☆55Updated 8 months ago
- ☆12Updated 4 months ago
- A multimodal agent built with Python agents framework☆18Updated this week
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆39Updated 4 months ago
- Second attempt at AI webcam, this time with OpenAI API☆38Updated last year
- A lightweight Python library for running TTS models with a unified API.☆16Updated last month
- Thin wrapper around OpenAI Whisper API with streaming support☆89Updated 3 weeks ago
- Website with current metrics on the fastest AI models.☆40Updated 3 months ago
- ☆59Updated last year
- AskYP is an open-source AI chatbot that uses OpenAI Functions and the Vercel AI SDK to interact with the Yelp Fusion API with natural lan…☆18Updated last year
- Run OpenAI Whisper as a Cog model☆61Updated 2 months ago