themanyone / whisper_dictation
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
☆216Updated this week
Alternatives and similar repositories for whisper_dictation:
Users that are interested in whisper_dictation are comparing it to the libraries listed below
- State-of-the-art offline voice typing everywhere + txt terminals (Linux or WFL sesson on Windows.) with a simple bash script. Usable with…☆93Updated last week
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆343Updated 10 months ago
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆202Updated 10 months ago
- ez audio transcription tool with flexible processing and post-processing options☆148Updated last year
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆105Updated 11 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆105Updated last month
- Real-Time Whisper Voice Recognition with vosk model feedback.☆111Updated last year
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆606Updated 7 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆207Updated 5 months ago
- This is a python script using whisper to type with your voice☆55Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆384Updated 7 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆260Updated 4 months ago
- web based editor for subtitles and transcripts☆128Updated 7 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆52Updated 7 months ago
- API server for Instant voice cloning by MyShell.☆88Updated 6 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆195Updated last month
- IRIS: Demonstrator for use of LLMs in python (outdated)☆62Updated 2 weeks ago
- Offline voice input panel & keyboard with punctuation for Android.☆102Updated 10 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆202Updated last month
- Short code for dictation using OpenAI Whisper for transcription.☆79Updated 3 weeks ago
- Handy voice dictation using whisper.☆14Updated 7 months ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆346Updated last year
- Command Your World with Voice☆634Updated 4 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 4 months ago
- 🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses XTTS, OpenAI, ElevenLabs or Kokoro☆194Updated this week
- XTTSv2 Extension for oobabooga text-generation-webui☆152Updated last year
- Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp offline. Speak with local LLMs.☆67Updated 4 months ago
- Use local llama LLM or openai to chat, discuss/summarize your documents, youtube videos, and so on.☆152Updated 3 months ago
- A curated list of awesome OpenAI's Whisper☆100Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago