AndreMarkert / whisper-webui
A browser interface based on the Gradio library for OpenAI's Whisper model.
☆37Updated last year
Alternatives and similar repositories for whisper-webui:
Users that are interested in whisper-webui are comparing it to the libraries listed below
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆89Updated this week
- Forked from https://huggingface.co/spaces/aadnk/faster-whisper-webui CLI to support running both transcribe and translate tasks or differ…☆19Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆144Updated last year
- web based editor for subtitles and transcripts☆121Updated 6 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- A QT GUI for large language models☆30Updated last year
- ☆80Updated 7 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆48Updated 6 months ago
- Site for sharing Bark voices☆48Updated 7 months ago
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆47Updated last week
- ☆65Updated 4 months ago
- FastAPI service on top of WhisperX☆68Updated 3 weeks ago
- "Just hoof it!" - A spotlight like interface to Ollama☆57Updated 10 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆60Updated last year
- A simple extension that uses Bark Text-to-Speech for audio output☆35Updated last year
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆42Updated last year
- ☆69Updated 11 months ago
- ☆48Updated last year
- A modern GUI application that transcribes and translates audio and video files, offering the option to save the subtitles as separate fil…☆14Updated last year
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆16Updated last year
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆13Updated last year
- ☆94Updated 9 months ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆45Updated 7 months ago
- A GUI interface for Open AI Whisper based on Tauri and Sveltekit☆121Updated 3 months ago
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆35Updated this week
- streaming speech to text server using Whisper☆86Updated last year
- One-click launcher for Audiocraft MusicGen + AudioGen Gradio Web UI☆66Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆65Updated last year
- Offline srt producer gui with whisper.cpp☆25Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆50Updated 2 months ago