dariox1337 / whisper-writerLinks
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
☆11Updated last year
Alternatives and similar repositories for whisper-writer
Users that are interested in whisper-writer are comparing it to the libraries listed below
Sorting:
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated last year
 - Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆155Updated last month
 - Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆28Updated last year
 - Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash☆40Updated last month
 - A curated list of awesome OpenAI's Whisper☆98Updated 2 years ago
 - Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆269Updated last month
 - An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meet…☆61Updated 6 months ago
 - a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
 - Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS)☆50Updated last month
 - An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆88Updated 9 months ago
 - Simulates talk with an AI that can express emotions☆81Updated 4 months ago
 - streaming speech to text server using Whisper☆95Updated 2 years ago
 - WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
 - Faster Whisper with additional features☆48Updated 7 months ago
 - IRIS: Demonstrator for use of LLMs in python (outdated)☆63Updated 7 months ago
 - Streaming and Fine-tuning for Chatterbox TTS☆204Updated 4 months ago
 - Sophia AI Assistant is a Python-based desktop AI that performs a variety of tasks, including answering questions, opening applications, b…☆22Updated last year
 - API server for Instant voice cloning by MyShell.☆104Updated last year
 - Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation …☆20Updated last month
 - A free & open tool for transcribing audio interviews with offline ASR support☆25Updated last year
 - 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆95Updated 4 months ago
 - Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files☆63Updated last month
 - A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆15Updated 5 months ago
 - Open source implementation for computer use, using light OCR models and LLMs. Get Android app in link below.☆29Updated 2 months ago
 - OpenAI compatible API for Dia-1.6B☆35Updated 6 months ago
 - Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆67Updated last year
 - Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
 - A QT GUI for large language models☆39Updated last year
 - This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆18Updated last year
 - Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI AP…☆358Updated 2 weeks ago