jhj0517 / Whisper-WebUI
A Web UI for easy subtitle using whisper model.
β1,700Updated last week
Alternatives and similar repositories for Whisper-WebUI:
Users that are interested in Whisper-WebUI are comparing it to the libraries listed below
- ποΈ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants ποΈβ1,386Updated 3 weeks ago
- Transcription, forced alignment, and audio indexing with OpenAI's Whisperβ1,749Updated 2 weeks ago
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.β1,640Updated 2 weeks ago
- Open Source project using LLMs to translate SRT subtitlesβ407Updated this week
- Whisper command line client compatible with original OpenAI client based on CTranslate2.β983Updated last week
- Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voicβ¦β3,273Updated 2 weeks ago
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning modβ¦β433Updated last week
- OpenAI Whisper ASR Webservice APIβ2,330Updated this week
- Converts text to speech in realtimeβ2,533Updated this week
- A nearly-live implementation of OpenAI's Whisper.β2,448Updated 2 weeks ago
- β1,384Updated this week
- Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabsβ498Updated 3 weeks ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.β689Updated 2 weeks ago
- Efficient translation tool based on ChatGPT or any OpenAI compatible LLM chat completion APIβ280Updated 3 weeks ago
- Multilingual Automatic Speech Recognition with word-level timestamps and confidenceβ2,244Updated last week
- Whisper realtime streaming for long speech-to-text transcription and translationβ2,471Updated last month
- WhisperPlus: Faster, Smarter, and More Capable πβ1,788Updated 2 weeks ago
- Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!β1,981Updated 3 weeks ago
- https://hf.co/hexgrad/Kokoro-82Mβ1,119Updated this week
- β559Updated 9 months ago
- Batch speech to text using OpenAI's whisper.β278Updated 2 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ584Updated 2 months ago
- ML-powered speech recognition directly in your browserβ2,787Updated 4 months ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speechβ314Updated 2 months ago
- Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRChaβ¦β414Updated last week
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcriβ¦β5,920Updated this week
- Controllable and fast Text-to-Speech for over 7000 languages!β1,545Updated 3 months ago
- AI powered speech denoising and enhancementβ1,641Updated 2 months ago
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitchingβ1,604Updated last week
- Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses lβ¦β408Updated last week