DictationDaddy / VAD_WEB_DEMO
In this repository, I show you how to use SILERO VAD with ONNX-WEB runtime to run the VAD compeletely in the browser.
☆13Updated last month
Related projects ⓘ
Alternatives and complementary repositories for VAD_WEB_DEMO
- GPU accelerated client-side embeddings for vector search, RAG etc.☆63Updated 11 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆38Updated 4 months ago
- Demo example of consumer goods categorization☆25Updated last year
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆45Updated last year
- CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search☆53Updated 10 months ago
- Run OpenAI Whisper as a Cog model☆60Updated 2 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆103Updated 9 months ago
- Browser-based Voice Assistant☆44Updated last year
- a simple system for 2-way interruptible voice interactions between human and LLM☆17Updated 9 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆51Updated 9 months ago
- Buildings block for voice-enabled applications in the browser☆33Updated last week
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆18Updated last month
- Experiments w/ ChatGPT, LangChain, local LLMs☆24Updated last year
- Sentence Embedding as a Service☆14Updated last year
- Generate visual podcasts about novels using open source models☆23Updated last year
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆24Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆45Updated last year
- Thin wrapper around OpenAI Whisper API with streaming support☆87Updated last month
- ☆17Updated 10 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Collection of ChatGPT plugins☆103Updated last year
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆152Updated 3 weeks ago
- Create Animated Subtitles From .SRT files in Remotion☆30Updated 7 months ago
- ☆12Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 3 months ago
- AI Assistant that can get stock prices☆46Updated 11 months ago
- [WIP] AI Try-On plugin for Chrome☆25Updated 8 months ago
- VideoDB Python SDK☆60Updated 2 weeks ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago