savbell / whisper-writer
π¬π A small dictation app using OpenAI's Whisper speech recognition model.
β304Updated 3 weeks ago
Related projects: β
- Fast! Offline, privacy-focused, hands-free voice typing, 2-way AI voice chat, AI images, webcam, recorder, voice control, in under 4 GiB β¦β157Updated this week
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ308Updated 3 months ago
- Modern GUI application that transcribes and translate audio files using OpenAI Whisper.β108Updated last month
- β384Updated this week
- A simple GUI to use Whisper.β84Updated last month
- Dictation app based on the OpenAI speech-to-text modelsβ143Updated last month
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Cβ¦β475Updated last month
- Clipboard Conqueror is a novel copy and paste copilot alternative designed to bring your very own LLM AI assistant to any text field.β327Updated 2 months ago
- Real time speech to text transcription app.β379Updated last year
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and β¦β147Updated 3 weeks ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engineβ276Updated 3 weeks ago
- Command Your World with Voiceβ368Updated 3 weeks ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β134Updated 3 weeks ago
- How to use OpenAIs Whisper to transcribe and diarize audio filesβ278Updated last year
- β486Updated 4 months ago
- β254Updated 2 weeks ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.β322Updated 8 months ago
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streamingβ234Updated 3 months ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.β308Updated 3 weeks ago
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning modβ¦β299Updated last week
- Podalize: Podcast Transcription and Analysisβ146Updated last week
- The subtitles and translations are generated in real-time and displayed as pop-ups.β111Updated last year
- Auto transcribe tool based on whisperβ213Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ262Updated 7 months ago
- This is a python script using whisper to type with your voiceβ50Updated last year
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS appβ171Updated 3 months ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatibleβ229Updated last month
- Python bindings for whisper.cppβ150Updated this week
- An AI assistant beyond the chat box.β314Updated 6 months ago
- Pybind11 bindings for Whisper.cppβ321Updated this week