jhj0517 / Whisper-WebUILinks
A Web UI for easy subtitle using whisper model.
β2,138Updated 2 months ago
Alternatives and similar repositories for Whisper-WebUI
Users that are interested in Whisper-WebUI are comparing it to the libraries listed below
Sorting:
- Whisper command line client compatible with original OpenAI client based on CTranslate2.β1,067Updated last month
- ποΈ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants ποΈβ1,519Updated last month
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.β2,240Updated 2 months ago
- A nearly-live implementation of OpenAI's Whisper.β3,080Updated last week
- β2,083Updated this week
- A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. β¦β594Updated last year
- Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!β2,422Updated last week
- Open Source project using LLMs to translate SRT subtitlesβ463Updated last month
- Transcription, forced alignment, and audio indexing with OpenAI's Whisperβ1,936Updated last month
- OpenAI Whisper ASR Webservice APIβ2,722Updated last week
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning modβ¦β811Updated last month
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.β791Updated 5 months ago
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitchingβ3,197Updated last week
- Faster Whisper transcription with CTranslate2β16,978Updated last month
- A simple GUI to use Whisper.β173Updated last week
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and β¦β326Updated last week
- A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice,β¦β2,338Updated this week
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advβ¦β1,898Updated last month
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includβ¦β565Updated last week
- Whisper realtime streaming for long speech-to-text transcription and translationβ3,096Updated 2 weeks ago
- An Open Source text-to-speech system built by inverting Whisper.β4,308Updated last month
- Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabsβ978Updated last week
- Multilingual Automatic Speech Recognition with word-level timestamps and confidenceβ2,500Updated 3 months ago
- Batch speech to text using OpenAI's whisper.β298Updated 3 months ago
- turnkey self-hosted offline transcription and diarization service with llm summaryβ866Updated 9 months ago
- Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)β277Updated this week
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisperβ4,712Updated 2 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.β169Updated 2 years ago
- a gradio webui for faster whisperβ268Updated 2 years ago
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), withβ¦β4,304Updated last month