jhj0517 / Whisper-WebUILinks
A Web UI for easy subtitle using whisper model.
☆2,586Updated this week
Alternatives and similar repositories for Whisper-WebUI
Users that are interested in Whisper-WebUI are comparing it to the libraries listed below
Sorting:
- A nearly-live implementation of OpenAI's Whisper.☆3,692Updated 3 months ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆1,176Updated 2 weeks ago
- 🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️☆1,605Updated 3 months ago
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.☆2,755Updated last month
- ☆2,722Updated last week
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,494Updated last month
- OpenAI Whisper ASR Webservice API☆3,078Updated last month
- Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!☆2,843Updated 4 months ago
- A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. …☆624Updated last year
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning mod…☆995Updated last week
- A simple GUI to use Whisper.☆361Updated 5 months ago
- Open Source project using LLMs to translate subtitles (SRT, SSA/ASS, VTT)☆531Updated last week
- Transcription, forced alignment, and audio indexing with OpenAI's Whisper☆2,108Updated 2 months ago
- Faster Whisper transcription with CTranslate2☆19,670Updated last month
- Real time transcription with OpenAI Whisper.☆2,905Updated 8 months ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,279Updated last month
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆407Updated 2 months ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆839Updated 10 months ago
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆2,188Updated 5 months ago
- A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice,…☆2,848Updated last month
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆179Updated 2 years ago
- Converts text to speech in realtime☆3,683Updated 5 months ago
- faster_whisper GUI with PySide6☆2,834Updated last year
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆1,027Updated 2 weeks ago
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆2,706Updated 3 months ago
- Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)☆1,676Updated 2 weeks ago
- Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs☆1,484Updated 5 months ago
- Multi-backend whisper app. Blazing fast. Mac-arm optimized. Easy install. Input a local file or url and this service will transcribe it u…☆1,171Updated 2 months ago
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with…