Alireza29675 / whisper-liveLinks
TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.
☆70Updated last year
Alternatives and similar repositories for whisper-live
Users that are interested in whisper-live are comparing it to the libraries listed below
Sorting:
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆109Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 6 months ago
- kokoro text to speech using javascript☆58Updated 4 months ago
- Context-Aware Semantic Cache for Conversational AI☆26Updated 5 months ago
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆210Updated last year
- Real-Time Voice Inference Web SDK☆246Updated last week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆95Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆119Updated last year
- Demo code from @juberti☆21Updated 2 weeks ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆132Updated last year
- faster-whisper as serverless endpoint☆105Updated last month
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- An API to transcribe audio with OpenAI's Whisper Large v3!☆286Updated 7 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆31Updated 9 months ago
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆43Updated last year
- WIP exploration using Twilio Media Streams and Generative AI☆40Updated last year
- Demo of AI chatbot that predicts user message to generate response quickly.☆103Updated last year
- streaming speech to text server using Whisper☆93Updated 2 years ago
- ☆53Updated 3 weeks ago
- Daily Client SDK for Python☆59Updated last week
- List of curated use cases built using Sesame's CSM 1B☆66Updated 3 weeks ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆63Updated last week
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 6 months ago
- ☆38Updated last week
- Build robust, production grade function calling assistants that work. Declarative and extensible. Built on top of LangChain ⚡️☆77Updated last year
- Faster Whisper with additional features☆44Updated 3 months ago
- Simulates talk with an AI that can express emotions☆71Updated last week
- A sample speech transcription app implementing OpenAI Text to Speech API based on Whisper, an automatic speech recognition (ASR) system, …☆80Updated last year
- ☆41Updated 9 months ago
- Chat interface that searches the web for you real-time☆100Updated 8 months ago