AndreMarkert / whisper-webuiLinks
A browser interface based on the Gradio library for OpenAI's Whisper model.
☆43Updated 2 years ago
Alternatives and similar repositories for whisper-webui
Users that are interested in whisper-webui are comparing it to the libraries listed below
Sorting:
- ez audio transcription tool with flexible processing and post-processing options☆159Updated last year
- XTTSv2 Extension for oobabooga text-generation-webui☆154Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆154Updated last month
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆267Updated last month
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆58Updated last year
- Oobabooga extension for Bark TTS☆118Updated last year
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆87Updated 8 months ago
- web based editor for subtitles and transcripts☆141Updated last year
- Site for sharing Bark voices☆51Updated 7 months ago
- ☆71Updated 2 months ago
- A curated list of awesome OpenAI's Whisper☆98Updated 2 years ago
- Forked from https://huggingface.co/spaces/aadnk/faster-whisper-webui CLI to support running both transcribe and translate tasks or differ…☆18Updated last year
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆42Updated last year
- Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 …☆161Updated last year
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆46Updated last year
- An API for VoiceCraft.☆25Updated last year
- ☆91Updated 5 months ago
- Local & Private LLM that drafts responses LIKE you automatically☆82Updated 11 months ago
- A multi-voice TTS system trained with an emphasis on quality☆26Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆73Updated 2 years ago
- Self-hosted AI medical scribe.☆52Updated this week
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆73Updated last year
- Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)☆290Updated this week
- A simple TTS server for generating speech using StyleTTS2☆37Updated last year
- A high-performance speech recognition MCP server based on Faster Whisper, providing efficient audio transcription capabilities.☆11Updated 7 months ago
- AI-Toolbox provides a collection of automation scripts and tools to streamline your AI workflows.☆68Updated last month
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-t…☆30Updated last year
- One-click launcher for Audiocraft MusicGen + AudioGen Gradio Web UI☆70Updated 2 years ago