madeyexz / whisper_subtitle
This project uses Whisper from OpenAI to generate video subtitles automatically.
☆18Updated last year
Related projects ⓘ
Alternatives and complementary repositories for whisper_subtitle
- ez audio transcription tool with flexible processing and post-processing options☆130Updated 9 months ago
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆49Updated 6 months ago
- Google Chrome SODA Offline Speech Recognition command line client☆150Updated last year
- web based editor for subtitles and transcripts☆112Updated 3 months ago
- A voice to text keyboard based on OpenAI Whisper Model.☆49Updated last year
- Offline voice input panel & keyboard with punctuation for Android.☆90Updated 5 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆75Updated last year
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆64Updated this week
- Get news from foreign RSS feeds translated, summarized, and spoken to you daily.☆23Updated this week
- Whisper.cpp Speech-to-text with Voice Acticity Detection☆12Updated 2 weeks ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆178Updated 2 months ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Updated last year
- An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI Whisper and input the recognized text; Supports English, Chinese, …☆43Updated this week
- a cross-platform and customizable vlc video player that can generate subtitles using WhisperX model☆9Updated last year
- Generate subtitles for long movies / podcasts with OpenAI Whisper API.☆27Updated last year
- Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRCha…☆401Updated 3 weeks ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆40Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆16Updated last month
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆95Updated this week
- Synchronize Whisper's timestamps over an existing accurate transcription☆132Updated 5 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆53Updated 10 months ago
- A Colab Notebook for OpenAI Whisper and DeepL API, aiming to create human-comparable results of translation and transcription.☆24Updated 9 months ago
- Download full or partial git-lfs repos without temporarily using 2x disk space☆30Updated last year
- AirLLM 70B inference with single 4GB GPU☆12Updated 3 months ago
- ☆20Updated 2 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆114Updated last year
- Modern GUI application that transcribes and translate audio files using OpenAI Whisper.☆119Updated 3 months ago
- A lightweight end-to-end text-to-speech model☆91Updated 2 months ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆18Updated last year