Alireza29675 / whisper-live
TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.
☆67Updated last year
Alternatives and similar repositories for whisper-live:
Users that are interested in whisper-live are comparing it to the libraries listed below
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆198Updated 9 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆43Updated 3 months ago
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆42Updated last year
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆104Updated 10 months ago
- Real-Time Voice Inference Web SDK☆204Updated this week
- This project is a straightforward demonstration that utilizes Vercel AI SDK to implement Generative UI.☆47Updated last year
- A sample speech transcription app implementing OpenAI Text to Speech API based on Whisper, an automatic speech recognition (ASR) system, …☆78Updated last year
- streaming speech to text server using Whisper☆90Updated last year
- Self-hosted AI voice agent☆94Updated 7 months ago
- Record and stream WAV audio data in the browser across all platforms☆62Updated 4 months ago
- Starter project for building real-time AI Voice Assistants☆37Updated 5 months ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆221Updated 4 months ago
- kokoro text to speech using javascript☆55Updated last month
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- FastAPI service on top of WhisperX☆76Updated this week
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆115Updated 10 months ago
- Local & private voice controlled notepad using whisper.cpp☆24Updated last year
- CodeWhisper: AI-Powered End-to-End Task Implementation & blazingly fast Codebase-to-LLM Context Bridge☆72Updated 3 months ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆75Updated 6 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆25Updated 6 months ago
- Transcription and Diarization based on OpenAI's Whisper☆21Updated last year
- Chat interface that searches the web for you real-time☆89Updated 5 months ago
- WIP exploration using Twilio Media Streams and Generative AI☆39Updated last year
- Data Questionnaire Agent Chatbot☆64Updated 2 weeks ago
- web based editor for subtitles and transcripts☆126Updated 7 months ago
- Open Sourced NoteBookLM☆58Updated 5 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆126Updated 9 months ago