dariox1337 / whisper-writerLinks
π¬π A small dictation app using OpenAI's Whisper speech recognition model.
β11Updated last year
Alternatives and similar repositories for whisper-writer
Users that are interested in whisper-writer are comparing it to the libraries listed below
Sorting:
- Parakeet 0.6b V2 + Pyannote diarization behind a Whisper APIβ54Updated last month
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.β278Updated 2 weeks ago
- streaming speech to text server using Whisperβ98Updated 2 years ago
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flashβ46Updated 3 weeks ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ100Updated last year
- llmon-py is a multimodal webui for Llama 3-8B.β16Updated last year
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLXβ29Updated last year
- β88Updated 10 months ago
- A curated list of awesome OpenAI's Whisperβ99Updated 2 years ago
- Faster Whisper with additional featuresβ48Updated 9 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.β89Updated 10 months ago
- Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation β¦β23Updated 3 months ago
- Streaming and Fine-tuning for Chatterbox TTSβ248Updated 6 months ago
- Simulates talk with an AI that can express emotionsβ82Updated 6 months ago
- a simple system for 2-way interruptible voice interactions between human and LLMβ30Updated last year
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wiβ¦β15Updated 7 months ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.β13Updated last year
- A free & open tool for transcribing audio interviews with offline ASR supportβ25Updated 2 years ago
- API server for Instant voice cloning by MyShell.β106Updated last year
- A simple, accessible and offline real-time transcription app for Android.β13Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.β162Updated this week
- chatterbox TTS + Voice Clone using onnxβ26Updated this week
- A full-stack document management and AI chat application that enables users to upload, manage, and chat with their documents using AI. Buβ¦β15Updated 4 months ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detectionβ115Updated last year
- ez audio transcription tool with flexible processing and post-processing optionsβ160Updated last year
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meetβ¦β62Updated 7 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.β58Updated last year
- Whisper from OpenAi and diarization with Pyannoteβ51Updated last year
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Aiβ¦β21Updated last year
- On-device streaming text-to-speech engine powered by deep learningβ122Updated last week