Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)
☆109Aug 18, 2025Updated 7 months ago
Alternatives and similar repositories for voice-activity-detection-vad-realtime
Users that are interested in voice-activity-detection-vad-realtime are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A modular, multi-agent AI research and report generation platform. Enter any topic, and PolyAgent Research Intelligence orchestrates mult…☆11Jan 20, 2025Updated last year
- Automation Assistant for UI Task Execution.☆11Jan 3, 2025Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆248Mar 26, 2026Updated 2 weeks ago
- Speak (speech-to-text) to LLMs (Ollama) in any lanaguage - Streamlit app https://deepwiki.com/iamaziz/llm-voice-bot☆47Feb 27, 2024Updated 2 years ago
- Welcome to the Real-Time Voice Activity Detection (VAD) program, powered by Silero-VAD model! 🚀 This program allows you to perform live …☆12Jul 9, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This Repo focuses on defending against 'adversarial prompts,' detecting and attempting to mitigate objectionable content in real time.☆14Jul 30, 2023Updated 2 years ago
- ☆17Apr 1, 2026Updated last week
- Voice activity detector (VAD) for the browser with a simple API☆1,902Jan 30, 2026Updated 2 months ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Updated this week
- Screenshots in record time - up to 2.5x faster than MSS (Multiple Screen Shots)☆12May 19, 2023Updated 2 years ago
- SMLP2022: Advanced methods in frequentist statistics with ulia☆14Jul 3, 2023Updated 2 years ago
- Datasets for turn-taking research☆19Dec 21, 2023Updated 2 years ago
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,741Mar 26, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Develop speaker recognition model based on i-vector using TIMIT database☆16Jul 4, 2019Updated 6 years ago
- This repository will contain projects on multi-agent applications using frameworks such as crewai, langchain, gradio, hugging face etc.☆24Aug 17, 2024Updated last year
- Minimalistic game engine with Lua scripting☆11Dec 20, 2023Updated 2 years ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆54Dec 11, 2022Updated 3 years ago
- Praat-based tools for EGG analysis☆19Sep 21, 2023Updated 2 years ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆12Jul 30, 2024Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- ☆16Jan 4, 2025Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Jun 6, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13May 28, 2025Updated 10 months ago
- A repo dedicated to different approaches in building a Persian Generative Chatbot.☆12Sep 7, 2022Updated 3 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- PyTorch implementation of STAGE model☆17Mar 17, 2025Updated last year
- Speech recognition with ESP32 and Edge Impulse.☆10Apr 25, 2024Updated last year
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated 2 years ago
- Embeddable ringcentral phone for hubspot(Google Chrome extension)☆12Mar 4, 2023Updated 3 years ago
- ice.js 2 官网&文档☆11Nov 17, 2022Updated 3 years ago
- ☆18Jan 20, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Notebooks for SMLP2021☆27Oct 16, 2021Updated 4 years ago
- 🐸STT integration examples☆132Sep 23, 2022Updated 3 years ago
- Open Source Text Embedding Models with OpenAI Compatible API☆167Jul 13, 2024Updated last year
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- ☆14Apr 7, 2020Updated 6 years ago
- Evaluation of bm42 sparse indexing algorithm☆75Jul 10, 2024Updated last year
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆120Jan 26, 2025Updated last year