jhj0517 / Whisper-WebUILinks
A Web UI for easy subtitle using whisper model.
β2,496Updated last month
Alternatives and similar repositories for Whisper-WebUI
Users that are interested in Whisper-WebUI are comparing it to the libraries listed below
Sorting:
- ποΈ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants ποΈβ1,582Updated 2 months ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.β1,140Updated 2 weeks ago
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.β2,654Updated last week
- β2,591Updated this week
- OpenAI Whisper ASR Webservice APIβ3,012Updated 4 months ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisperβ5,122Updated last month
- Open Source project using LLMs to translate subtitles (SRT, SSA/ASS, VTT)β516Updated last month
- A nearly-live implementation of OpenAI's Whisper.β3,578Updated last month
- Batch speech to text using OpenAI's whisper.β302Updated 7 months ago
- Transcription, forced alignment, and audio indexing with OpenAI's Whisperβ2,068Updated 2 weeks ago
- Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabsβ1,380Updated 4 months ago
- A simple GUI to use Whisper.β258Updated 3 months ago
- A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. β¦β617Updated last year
- Multilingual Automatic Speech Recognition with word-level timestamps and confidenceβ2,657Updated 2 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ3,440Updated this week
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning modβ¦β971Updated last month
- Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!β2,783Updated 3 months ago
- A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice,β¦β2,726Updated last week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β18,684Updated 3 weeks ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and β¦β385Updated 3 weeks ago
- Multi-backend whisper app. Blazing fast. Mac-arm optimized. Easy install. Input a local file or url and this service will transcribe it uβ¦β1,139Updated last month
- ML-powered speech recognition directly in your browserβ3,146Updated last year
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advβ¦β2,123Updated 4 months ago
- Automatically generate and overlay subtitles for any video.β2,091Updated last year
- Faster Whisper transcription with CTranslate2β18,962Updated 2 weeks ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.β830Updated 9 months ago
- TTS with kokoro and onnx runtimeβ2,255Updated 4 months ago
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includβ¦β899Updated 2 months ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,979Updated 10 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ865Updated 5 months ago