themanyone / whisper_dictation
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
β177Updated last month
Related projects β
Alternatives and complementary repositories for whisper_dictation
- State-of-the-art voice typing in Linux terminal (or WFL sesson on Windows.) with a simple bash script. Works with X. Does not require X.β62Updated last month
- π¬π A small dictation app using OpenAI's Whisper speech recognition model.β358Updated 2 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ322Updated 5 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ68Updated 6 months ago
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β196Updated 3 weeks ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engineβ312Updated 2 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)β47Updated last month
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.β47Updated 8 months ago
- Command Your World with Voiceβ443Updated this week
- Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 β¦β137Updated 3 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β157Updated last month
- Whisper realtime streaming for long speech-to-text transcription and translationβ103Updated 9 months ago
- XTTSv2 Extension for oobabooga text-generation-webuiβ147Updated last year
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS appβ186Updated 5 months ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Cβ¦β518Updated 3 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.β69Updated last month
- Real-Time Whisper Voice Recognition with vosk model feedback.β105Updated last year
- web based editor for subtitles and transcriptsβ112Updated 3 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteβ169Updated 2 months ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detectionβ91Updated 6 months ago
- A curated list of awesome OpenAI's Whisperβ93Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.β43Updated 3 months ago
- Live-Transcription (STT) with Whisper PoCβ155Updated 5 months ago
- Simulates talk with an AI that can express emotionsβ30Updated 3 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β53Updated 10 months ago
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow youβ¦β112Updated this week
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β63Updated this week
- This is a python script using whisper to type with your voiceβ52Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ84Updated 6 months ago
- API server for Instant voice cloning by MyShell.β69Updated last month