dariox1337 / whisper-writerLinks
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
☆11Updated last year
Alternatives and similar repositories for whisper-writer
Users that are interested in whisper-writer are comparing it to the libraries listed below
Sorting:
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated last year
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Updated last year
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆15Updated 6 months ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆38Updated 10 months ago
- streaming speech to text server using Whisper☆98Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆98Updated 2 years ago
- ☆87Updated 9 months ago
- Simulates talk with an AI that can express emotions☆82Updated 5 months ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆276Updated 2 months ago
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆19Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆159Updated 2 months ago
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash☆40Updated 2 weeks ago
- On-device streaming text-to-speech engine powered by deep learning☆122Updated last week
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆139Updated 8 months ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated last year
- Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI AP…☆397Updated last month
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meet…☆61Updated 6 months ago
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆62Updated 10 months ago
- Whisper from OpenAi and diarization with Pyannote☆50Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆159Updated last year
- Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation …☆21Updated 2 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆220Updated 5 months ago
- Faster Whisper with additional features☆48Updated 8 months ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Updated 2 months ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆115Updated last year
- A local and uncensored AI entity.☆96Updated 3 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆98Updated 5 months ago
- EPUB, PDF, DOCX, MD, and TXT file text to speech document reader. Read documents in realtime with high-quality TTS; or extract audiobooks…☆234Updated this week