DictationDaddy / VAD_WEB_DEMOLinks

In this repository, I show you how to use SILERO VAD with ONNX-WEB runtime to run the VAD compeletely in the browser.

☆22

Alternatives and similar repositories for VAD_WEB_DEMO

Users that are interested in VAD_WEB_DEMO are comparing it to the libraries listed below

Sorting:

pipecat-ai / web-client-ui
An JS web client for connecting to Pipecat bots with voice and vision
☆44Updated 5 months ago
luweigen / whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
☆118Updated last year
daily-demos / daily-bots-web-demo
Daily Bots Web Demo showcasing how to build real-time voice AI agents
☆243Updated 7 months ago
latishab / turnsense
A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.
☆21Updated 2 months ago
ai-bot-pro / achatbot
An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.
☆53Updated this week
tarzain / crosstalk
a simple system for 2-way interruptible voice interactions between human and LLM
☆29Updated last year
pipecat-ai / pipecat-client-web
Real-Time Voice Inference Web SDK
☆241Updated this week
tincans-ai / gazelle-inference
proof of concept conversation orchestrator with a speech-language model
☆20Updated 7 months ago
pipecat-ai / pipecat-client-web-transports
A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package
☆23Updated last week
plaggy / fast-whisper-server
ASR + diarization model server with speculative decoding
☆60Updated last year
daily-demos / llm-talk
Talk to GPT-4 and create a story together.
☆90Updated last year
atyenoria / livekit-whisper-transcribe
☆26Updated 2 years ago
livekit-examples / realtime-playground
Play with OpenAI's new Realtime API in your browser
☆327Updated 5 months ago
livekit-examples / cartesia-voice-agent
An example Voice Pipeline Agent with Cartesia
☆22Updated 2 months ago
livekit / sip
SIP to WebRTC bridge for LiveKit
☆221Updated this week
FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆66Updated last year
pyannote / pyannote-pipeline
Tunable pipelines
☆34Updated 3 months ago
mesolitica / vllm-whisper
A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper
☆27Updated 10 months ago
livekit-examples / agent-demos
Small demos demonstrating different capabilities of LiveKit Agents
☆14Updated 2 months ago
DongKeon / webrtc-whisper-asr
WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.
☆12Updated 8 months ago
livekit-examples / voice-assistant-frontend
A simple voice assistant example built with Next.js and LiveKit React Components
☆195Updated this week
lalanikarim / webrtc-ai-voice-chat
A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.
☆132Updated 11 months ago
abb128 / turndetection
☆15Updated 3 months ago
eustlb / speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
☆62Updated 7 months ago
AgoraIO / openai-realtime-python
Real-time voice agent powered by Agora and OpenAI
☆82Updated 2 months ago
livekit-examples / voice-pipeline-agent-python
A basic voice agent built with Python agents framework
☆47Updated last month
xinliu9451 / awesome-denoiser
This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …
☆37Updated 6 months ago
Alireza29675 / whisper-live
TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.
☆70Updated last year
bentoml / CLIP-API-service
CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search
☆62Updated last year
gianpaj / call-me-please
A mobile application that lets users schedule AI-powered voice calls 📞🤖 - React Native + LiveKit + OpenAI Realtime API
☆14Updated last month