savbell / whisper-writer
π¬π A small dictation app using OpenAI's Whisper speech recognition model.
β793Updated 8 months ago
Alternatives and similar repositories for whisper-writer:
Users that are interested in whisper-writer are comparing it to the libraries listed below
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.β224Updated this week
- Short code for dictation using OpenAI Whisper for transcription.β86Updated last month
- β616Updated last month
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ350Updated 10 months ago
- A modular voice assistant application for experimenting with state-of-the-art transcription, response generation, and text-to-speech modeβ¦β973Updated 2 weeks ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.β324Updated 2 weeks ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engineβ403Updated 8 months ago
- A simple GUI to use Whisper.β153Updated 2 weeks ago
- Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!β2,217Updated 3 months ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.β755Updated 3 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.β113Updated this week
- Command Your World with Voiceβ659Updated 4 months ago
- Run Orpheus 3B Locally With LM Studioβ392Updated last month
- State-of-the-art offline voice typing everywhere + txt terminals (Linux or WFL sesson on Windows.) with a simple bash script. Usable withβ¦β104Updated 2 weeks ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.β736Updated 2 months ago
- A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. β¦β571Updated last year
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Cβ¦β628Updated 8 months ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detectionβ107Updated last year
- Clipboard Conqueror is a novel copy and paste copilot alternative designed to bring your very own LLM AI assistant to any text field.β399Updated 3 months ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatibleβ311Updated 2 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β211Updated 3 weeks ago
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includβ¦β392Updated this week
- Generate imagined websites on an infinite canvasβ600Updated 10 months ago
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning modβ¦β503Updated this week
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and β¦β273Updated 3 weeks ago
- Mac compatible Ollama Voiceβ479Updated last year
- Real time speech to text transcription app.β408Updated 2 years ago
- Multi-backend whisper app. Blazing fast. Mac-arm optimized. Easy install. Input a local file or url and this service will transcribe it uβ¦β713Updated last week
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with a more realistic Kokoro TTS voice and vision.β56Updated 3 months ago
- Video transcript summarization from multiple sources (YouTube, Dropbox, Google Drive, local files) using multiple LLM endpoints (OpenAI, β¦β117Updated last month