Live-Transcription (STT) with Whisper PoC
☆201Jun 18, 2024Updated 2 years ago
Alternatives and similar repositories for whisper-live-transcription
Users that are interested in whisper-live-transcription are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS☆958Oct 2, 2024Updated last year
- A nearly-live implementation of OpenAI's Whisper.☆4,092Updated this week
- Real time transcription with OpenAI Whisper.☆2,940Apr 15, 2025Updated last year
- Live transcription with OpenAi Whisper☆50Nov 11, 2022Updated 3 years ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆195Jun 8, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- network pinger with UI☆14Feb 12, 2024Updated 2 years ago
- real-time transcription application☆12Jun 9, 2023Updated 3 years ago
- Record audio or transcribe files using ctranslate2 and whisper!☆200Jun 12, 2026Updated last week
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆406Jun 8, 2024Updated 2 years ago
- A high-performance speech recognition MCP server based on Faster Whisper, providing efficient audio transcription capabilities.☆18Mar 22, 2025Updated last year
- A python package to build AI-powered real-time audio applications☆1,987Feb 12, 2025Updated last year
- Real-time transcription using faster-whisper☆615Jul 23, 2024Updated last year
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆833Sep 12, 2025Updated 9 months ago
- playlist4whisper manages media streams playlists for livestream_video.sh, plays media, and transcribes audio via AI with configurable tim…☆17May 31, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- web based editor for subtitles and transcripts☆147Aug 16, 2024Updated last year
- Getting VibeVoice 7b working with 10 gb of vram.☆15Aug 31, 2025Updated 9 months ago
- Faster Whisper transcription with CTranslate2☆23,584Nov 19, 2025Updated 7 months ago
- Auto-Video maker handling many AI's☆11Mar 18, 2024Updated 2 years ago
- ☆12,966Oct 25, 2025Updated 7 months ago
- RAG Tool using Haystack, Mistral, and Chainlit. All open source stack on CPU.☆22Oct 14, 2023Updated 2 years ago
- ☆33May 22, 2024Updated 2 years ago
- A voice chat app☆1,208May 28, 2026Updated 3 weeks ago
- A VoiceAsistant with WhisperAI speech recognition☆33Nov 21, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Test your local LLMs on the AIME problems☆39Jun 7, 2025Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆102May 7, 2024Updated 2 years ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆361Jul 20, 2025Updated 10 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆957Jun 3, 2025Updated last year
- A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.☆22Jun 6, 2026Updated last week
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Feb 5, 2024Updated 2 years ago
- An Open Source text-to-speech system built by inverting Whisper.☆4,613Dec 14, 2025Updated 6 months ago
- Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRCha…☆534Mar 16, 2026Updated 3 months ago
- How to Build an AI Children’s Book Service☆31Nov 2, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- streaming speech to text server using Whisper☆102Jun 2, 2023Updated 3 years ago
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆320Jun 17, 2025Updated last year
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated last year
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆65Aug 13, 2024Updated last year
- ☆3,393Updated this week
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆4,084Jan 8, 2025Updated last year
- Simple LLM interface based on terminal.☆12Jan 4, 2024Updated 2 years ago