Live-Transcription (STT) with Whisper PoC
☆200Jun 18, 2024Updated last year
Alternatives and similar repositories for whisper-live-transcription
Users that are interested in whisper-live-transcription are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS☆954Oct 2, 2024Updated last year
- Real time transcription with OpenAI Whisper.☆2,921Apr 15, 2025Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,599Nov 12, 2025Updated 5 months ago
- Live transcription with OpenAi Whisper☆50Nov 11, 2022Updated 3 years ago
- Transcribe is a real time transcription, conversation, Language learning platform. It provides live transcripts from microphone and speak…☆254Mar 14, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆187Jun 8, 2023Updated 2 years ago
- Record audio or transcribe files using ctranslate2 and whisper!☆188Apr 8, 2026Updated last week
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆396Jun 8, 2024Updated last year
- A python package to build AI-powered real-time audio applications☆1,966Feb 12, 2025Updated last year
- Real-time transcription using faster-whisper☆614Jul 23, 2024Updated last year
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆831Sep 12, 2025Updated 7 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆122Jan 29, 2024Updated 2 years ago
- Getting VibeVoice 7b working with 10 gb of vram.☆15Aug 31, 2025Updated 7 months ago
- Faster Whisper transcription with CTranslate2☆22,222Nov 19, 2025Updated 5 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆12,443Oct 25, 2025Updated 5 months ago
- RAG Tool using Haystack, Mistral, and Chainlit. All open source stack on CPU.☆23Oct 14, 2023Updated 2 years ago
- Test your local LLMs on the AIME problems☆34Jun 7, 2025Updated 10 months ago
- A VoiceAsistant with WhisperAI speech recognition☆32Nov 21, 2024Updated last year
- This repository contains a simple vocoder that works with live input. The vocoder uses LPC coefficients to do voice transformations and/o…☆14Aug 19, 2022Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆101May 7, 2024Updated last year
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆359Jul 20, 2025Updated 8 months ago
- The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…☆17Aug 24, 2023Updated 2 years ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆940Jun 3, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.☆22Updated this week
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Feb 5, 2024Updated 2 years ago
- An Open Source text-to-speech system built by inverting Whisper.☆4,590Dec 14, 2025Updated 4 months ago
- Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference,…☆14May 7, 2024Updated last year
- Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRCha…☆522Mar 16, 2026Updated last month
- streaming speech to text server using Whisper☆102Jun 2, 2023Updated 2 years ago
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming☆316Jun 17, 2025Updated 10 months ago
- A starting point for developing your own plug-in for Auto-GPT☆22May 9, 2023Updated 2 years ago
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆63Aug 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆3,166Apr 9, 2026Updated last week
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆4,068Jan 8, 2025Updated last year
- Simple LLM interface based on terminal.☆12Jan 4, 2024Updated 2 years ago
- Original, unedited source-code for budding modders out there.☆11Jan 2, 2018Updated 8 years ago
- [DEPREDATED] Central repo for fraud and risk management development and specifications☆15May 20, 2025Updated 10 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- VSCode extension for working with Architecture As A Code in the C4 model. Includes syntax highlighting, diagram preview, and tools for wo…☆36Apr 7, 2026Updated last week