AIWintermuteAI / WhisperLive
A nearly-live implementation of OpenAI's Whisper.
☆22Updated 3 months ago
Alternatives and similar repositories for WhisperLive:
Users that are interested in WhisperLive are comparing it to the libraries listed below
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆153Updated last year
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆104Updated 11 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆71Updated 2 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆102Updated last month
- streaming speech to text server using Whisper☆89Updated last year
- ☆43Updated 4 months ago
- Pybind11 bindings for Whisper.cpp☆54Updated last month
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meet…☆37Updated 2 months ago
- Real-time Speech To Text using Faster Whisper.☆53Updated 7 months ago
- Automatic Speech Recognition Assistant☆31Updated 10 months ago
- Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for speech-to-text and offers a privacy-focused, acce…☆91Updated 10 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆192Updated last month
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆156Updated 8 months ago
- Chat with your pdf using your local LLM, OLLAMA client.(incomplete)☆36Updated 5 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆51Updated 7 months ago
- Whisperx API implementation☆25Updated 10 months ago
- ☆45Updated 6 months ago
- Experimental Python SDK for OpenAI's Realtime API☆43Updated last month
- Sesame CSM 1B Voice Cloning☆246Updated 2 weeks ago
- ☆43Updated 3 months ago
- On-device streaming text-to-speech engine powered by deep learning☆73Updated this week
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆143Updated last week
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆43Updated 2 months ago
- ☆1,622Updated this week
- ☆91Updated 2 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆53Updated 4 months ago
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆33Updated 7 months ago
- FastAPI service on top of WhisperX☆78Updated this week
- Open source repo for AI in a Box.☆63Updated 11 months ago
- Daily Client SDK for Python☆54Updated this week