DictationDaddy / VAD_WEB_DEMOLinks
In this repository, I show you how to use SILERO VAD with ONNX-WEB runtime to run the VAD compeletely in the browser.
☆22Updated 5 months ago
Alternatives and similar repositories for VAD_WEB_DEMO
Users that are interested in VAD_WEB_DEMO are comparing it to the libraries listed below
Sorting:
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 5 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆118Updated last year
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆243Updated 7 months ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆21Updated 2 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆53Updated this week
- a simple system for 2-way interruptible voice interactions between human and LLM☆29Updated last year
- Real-Time Voice Inference Web SDK☆241Updated this week
- proof of concept conversation orchestrator with a speech-language model☆20Updated 7 months ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆23Updated last week
- ASR + diarization model server with speculative decoding☆60Updated last year
- Talk to GPT-4 and create a story together.☆90Updated last year
- ☆26Updated 2 years ago
- Play with OpenAI's new Realtime API in your browser☆327Updated 5 months ago
- An example Voice Pipeline Agent with Cartesia☆22Updated 2 months ago
- SIP to WebRTC bridge for LiveKit☆221Updated this week
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Tunable pipelines☆34Updated 3 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆27Updated 10 months ago
- Small demos demonstrating different capabilities of LiveKit Agents☆14Updated 2 months ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆12Updated 8 months ago
- A simple voice assistant example built with Next.js and LiveKit React Components☆195Updated this week
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆132Updated 11 months ago
- ☆15Updated 3 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆62Updated 7 months ago
- Real-time voice agent powered by Agora and OpenAI☆82Updated 2 months ago
- A basic voice agent built with Python agents framework☆47Updated last month
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆37Updated 6 months ago
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆70Updated last year
- CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search☆62Updated last year
- A mobile application that lets users schedule AI-powered voice calls 📞🤖 - React Native + LiveKit + OpenAI Realtime API☆14Updated last month