Alireza29675 / whisper-liveLinks
TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.
☆71Updated last year
Alternatives and similar repositories for whisper-live
Users that are interested in whisper-live are comparing it to the libraries listed below
Sorting:
- Real-Time Voice Inference Web SDK☆287Updated this week
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆216Updated last year
- faster-whisper as serverless endpoint☆117Updated 4 months ago
- kokoro text to speech using javascript☆62Updated 7 months ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆112Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆136Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 9 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆150Updated 2 weeks ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆223Updated 7 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆302Updated 10 months ago
- streaming speech to text server using Whisper☆93Updated 2 years ago
- Talk to GPT-4 and create a story together.☆91Updated last year
- List of curated use cases built using Sesame's CSM 1B☆73Updated 3 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆120Updated last year
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆245Updated last week
- ☆50Updated 11 months ago
- Automatically generate engaging AI podcasts from nothing but an episode title.☆127Updated last month
- A Function Calls Proxy for Groq, the fastest AI alive!☆204Updated last year
- ☆77Updated last month
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆215Updated 11 months ago
- The JavaScript client for the Cartesia API.☆112Updated 2 weeks ago
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆44Updated last year
- ☆56Updated 3 months ago
- ☆89Updated last year
- WIP exploration using Twilio Media Streams and Generative AI☆40Updated last year
- Demo of AI chatbot that predicts user message to generate response quickly.☆104Updated last year
- Record and stream WAV audio data in the browser across all platforms☆88Updated 10 months ago
- Use ChatGPT over Twilio to create an AI phone agent (works for incoming or outgoing calls).☆113Updated last year