jhj0517 / Whisper-WebUI
A Web UI for easy subtitle using whisper model.
β1,800Updated last week
Alternatives and similar repositories for Whisper-WebUI:
Users that are interested in Whisper-WebUI are comparing it to the libraries listed below
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.β1,808Updated last month
- ποΈ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants ποΈβ1,432Updated this week
- OpenAI Whisper ASR Webservice APIβ2,453Updated last month
- Whisper command line client compatible with original OpenAI client based on CTranslate2.β993Updated last month
- A nearly-live implementation of OpenAI's Whisper.β2,600Updated 3 weeks ago
- Multilingual Automatic Speech Recognition with word-level timestamps and confidenceβ2,313Updated last month
- β1,582Updated this week
- Whisper realtime streaming for long speech-to-text transcription and translationβ2,633Updated 2 months ago
- Faster Whisper transcription with CTranslate2β14,882Updated 2 months ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.β718Updated last month
- Transcription, forced alignment, and audio indexing with OpenAI's Whisperβ1,794Updated last week
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,571Updated 11 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β14,557Updated 2 weeks ago
- TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5,β¦β2,066Updated this week
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisperβ4,273Updated last week
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning modβ¦β459Updated this week
- turnkey self-hosted offline transcription and diarization service with llm summaryβ825Updated 5 months ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,793Updated 2 months ago
- a gradio webui for faster whisperβ255Updated last year
- Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!β2,096Updated last month
- Open Source project using LLMs to translate SRT subtitlesβ434Updated 2 weeks ago
- A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. β¦β561Updated last year
- Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and doβ¦β783Updated 2 years ago
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advβ¦β1,619Updated this week
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and β¦β244Updated this week
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ5,310Updated last month
- β570Updated 10 months ago
- ML-powered speech recognition directly in your browserβ2,854Updated 5 months ago
- Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabsβ618Updated this week
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), withβ¦β3,493Updated this week