Project that allows one to use a microphone with OpenAI whisper.
☆787Jul 4, 2024Updated last year
Alternatives and similar repositories for whisper_mic
Users that are interested in whisper_mic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project☆1,039Aug 29, 2023Updated 2 years ago
- ☆1,891Aug 3, 2025Updated 8 months ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆359Jul 20, 2025Updated 8 months ago
- Real time transcription with OpenAI Whisper.☆2,919Apr 15, 2025Updated last year
- Streaming transcriber with whisper☆693May 1, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆831Sep 12, 2025Updated 7 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Aug 16, 2023Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆97,479Mar 27, 2026Updated 2 weeks ago
- AI Livestreamer for Youtube☆67Mar 19, 2023Updated 3 years ago
- A nearly-live implementation of OpenAI's Whisper.☆3,962Mar 17, 2026Updated 3 weeks ago
- Shared Voice Interface☆43Oct 21, 2023Updated 2 years ago
- Faster Whisper transcription with CTranslate2☆22,041Nov 19, 2025Updated 4 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,599Nov 12, 2025Updated 5 months ago
- Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRCha…☆522Mar 16, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Real time speech to text transcription app.☆435Jan 14, 2023Updated 3 years ago
- A quick experiment to achieve almost realtime transcription using Whisper.☆184Sep 22, 2022Updated 3 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Dec 16, 2023Updated 2 years ago
- Port of OpenAI's Whisper model in C/C++☆48,661Mar 29, 2026Updated 2 weeks ago
- Speaker prediction for captions on the Lex Fridman podcast☆27Feb 14, 2024Updated 2 years ago
- Real-time transcription using faster-whisper☆614Jul 23, 2024Updated last year
- AI Livestreamer for Youtube☆489Mar 18, 2023Updated 3 years ago
- OpenAI Whisper ASR Webservice API☆3,233Nov 23, 2025Updated 4 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆21,210Apr 4, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,688Apr 3, 2024Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Feb 4, 2023Updated 3 years ago
- ☆11Sep 5, 2025Updated 7 months ago
- ☆265Mar 19, 2023Updated 3 years ago
- ☆27Nov 3, 2025Updated 5 months ago
- openai/whisper + extra features☆90Oct 26, 2022Updated 3 years ago
- A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.☆22Updated this week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Jun 30, 2023Updated 2 years ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆544Nov 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆4,068Jan 8, 2025Updated last year
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆2,794Sep 9, 2025Updated 7 months ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,734Updated this week
- A multi-voice TTS system trained with an emphasis on quality☆14,832Nov 19, 2024Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Apr 1, 2021Updated 5 years ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆45,043Aug 16, 2024Updated last year