yohasebe / whisper-stream
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
β100Updated 8 months ago
Alternatives and similar repositories for whisper-stream:
Users that are interested in whisper-stream are comparing it to the libraries listed below
- An API to transcribe audio with OpenAI's Whisper Large v3!β232Updated 2 months ago
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β203Updated 2 months ago
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS appβ195Updated 7 months ago
- web based editor for subtitles and transcriptsβ118Updated 5 months ago
- Live-Transcription (STT) with Whisper PoCβ167Updated 7 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ110Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ91Updated 8 months ago
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.β189Updated 3 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ333Updated 7 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteβ189Updated 4 months ago
- A simple client and utils for interacting with OpenAI's Realtime API in Pythonβ208Updated 2 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.β83Updated 2 weeks ago
- Demo of AI chatbot that predicts user message to generate response quickly.β100Updated 11 months ago
- streaming speech to text server using Whisperβ85Updated last year
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JSβ786Updated 3 months ago
- A Function Calls Proxy for Groq, the fastest AI alive!β183Updated 10 months ago
- Whisper2Summarize is an application that uses Whisper for audio processing and GPT for summarization. It generates summaries of audio traβ¦β51Updated last year
- Example of calling OpenRouter from a Streamit appβ94Updated last year
- ez audio transcription tool with flexible processing and post-processing optionsβ141Updated 11 months ago
- Transcription with speaker diarization pipelineβ89Updated last year
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.β64Updated last year
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.β113Updated 8 months ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agentsΒβ194Updated 3 months ago
- Deepgram Conversational AI demoβ361Updated last week
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β134Updated this week
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.β120Updated 7 months ago
- β167Updated 5 months ago
- Visual node-edge graph GUI editor, with electron and docker wrapper for frontend,backend,localLLMβ115Updated this week
- β348Updated 9 months ago
- Open source conversation framework and visual editor for structured Pipecat dialoguesβ115Updated last week