alesaccoia / VoiceStreamAI
Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS
☆650Updated 2 months ago
Related projects: ⓘ
- Whisper realtime streaming for long speech-to-text transcription and translation☆1,770Updated 2 weeks ago
- ☆384Updated this week
- A nearly-live implementation of OpenAI's Whisper.☆1,798Updated 2 weeks ago
- Build real-time multimodal AI applications 🤖🎙️📹☆1,053Updated this week
- A python package to build AI-powered real-time audio applications☆992Updated 2 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,509Updated last month
- ☆1,079Updated 2 months ago
- Voice activity detector (VAD) for the browser with a simple API☆773Updated last month
- Converts text to speech in realtime☆1,730Updated 3 weeks ago
- Local SRT/LLM/TTS Voicechat☆471Updated last month
- Live-Transcription (STT) with Whisper PoC☆140Updated 3 months ago
- Whisper with Medusa heads☆774Updated last week
- Example UI implementing the RTVI web client☆468Updated last month
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆475Updated last month
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆3,315Updated 2 weeks ago
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆1,865Updated last month
- ☆419Updated this week
- ☆486Updated 4 months ago
- Real time transcription with OpenAI Whisper.☆2,260Updated 3 months ago
- Real time speech to text transcription app.☆379Updated last year
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆308Updated 3 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆276Updated 3 weeks ago
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…☆1,641Updated last week
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆262Updated 7 months ago
- A fast multimodal LLM for real-time voice☆847Updated this week
- Command Your World with Voice☆368Updated 3 weeks ago
- A talking LLM that runs on your own computer without needing the internet.☆222Updated last month
- Deepgram Conversational AI demo☆324Updated 2 weeks ago
- Pybind11 bindings for Whisper.cpp☆321Updated this week
- ☆431Updated 2 months ago