jhj0517 / Whisper-WebUI
A Web UI for easy subtitle using whisper model.
☆1,867Updated this week
Alternatives and similar repositories for Whisper-WebUI:
Users that are interested in Whisper-WebUI are comparing it to the libraries listed below
- A nearly-live implementation of OpenAI's Whisper.☆2,688Updated this week
- OpenAI Whisper ASR Webservice API☆2,519Updated last month
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆998Updated 2 months ago
- ☆1,675Updated this week
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆3,825Updated 3 months ago
- 🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️☆1,445Updated last week
- TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5,…☆2,105Updated this week
- Whisper realtime streaming for long speech-to-text transcription and translation☆2,729Updated 3 months ago
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.☆1,925Updated this week
- Faster Whisper transcription with CTranslate2☆15,291Updated 3 weeks ago
- Multi-backend whisper app. Blazing fast. Mac-arm optimized. Easy install. Input a local file or url and this service will transcribe it u…☆692Updated last week
- a gradio webui for faster whisper☆258Updated last year
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆742Updated 2 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆667Updated 3 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆155Updated last year
- Transcription, forced alignment, and audio indexing with OpenAI's Whisper☆1,832Updated 2 weeks ago
- Batch speech to text using OpenAI's whisper.☆290Updated 2 weeks ago
- An Open Source text-to-speech system built by inverting Whisper.☆4,193Updated this week
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆1,681Updated this week
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆606Updated 8 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆758Updated 8 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,591Updated 8 months ago
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning mod…☆483Updated last week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆5,630Updated 8 months ago
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS☆846Updated 6 months ago
- Webui for using XTTS and for finetuning it☆776Updated 2 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆344Updated 10 months ago
- ML-powered speech recognition directly in your browser☆2,883Updated 6 months ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆4,364Updated last month
- ☆743Updated 5 months ago