themanyone / whisper_dictationLinks
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
☆239Updated last week
Alternatives and similar repositories for whisper_dictation
Users that are interested in whisper_dictation are comparing it to the libraries listed below
Sorting:
- State-of-the-art offline voice typing everywhere + txt terminals (Linux or WFL sesson on Windows.) with a simple bash script. Usable with…☆115Updated 3 weeks ago
- This is a python script using whisper to type with your voice☆58Updated 3 weeks ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆642Updated 9 months ago
- 💬📝 A small dictation app using OpenAI's Whisper speech recognition model.☆821Updated 9 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆419Updated 9 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆122Updated 3 weeks ago
- Short code for dictation using OpenAI Whisper for transcription.☆91Updated 2 months ago
- ☆95Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆213Updated 7 months ago
- Command Your World with Voice☆687Updated 5 months ago
- IRIS: Demonstrator for use of LLMs in python (outdated)☆62Updated 2 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆217Updated last month
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆352Updated 11 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆234Updated 3 months ago
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆205Updated 11 months ago
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with Kokoro TTS voice and vision.☆57Updated 4 months ago
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆46Updated 4 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- Clipboard Conqueror is a novel copy and paste copilot alternative designed to bring your very own LLM AI assistant to any text field.☆401Updated 4 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 7 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- API server for Instant voice cloning by MyShell.☆93Updated 8 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆94Updated 3 weeks ago
- Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 …☆158Updated 9 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆277Updated 6 months ago
- XTTSv2 Extension for oobabooga text-generation-webui☆152Updated last year
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆186Updated 3 weeks ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆109Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆150Updated last year
- Add AI capabilities to your file system using Ollama, Groq, OpenAi and other's api☆193Updated 4 months ago