Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models and pyannote/nemo models in order to identify different speakers.
☆19Mar 10, 2023Updated 3 years ago
Alternatives and similar repositories for whisper_subtitler
Users that are interested in whisper_subtitler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE…☆11May 5, 2024Updated 2 years ago
- The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…☆17Aug 24, 2023Updated 2 years ago
- Transcribe with ease :D☆16Jun 21, 2023Updated 2 years ago
- ☆21Jun 8, 2023Updated 2 years ago
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆47Aug 6, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Browser extension to help users find and manage scholarships.☆19Feb 11, 2025Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆164May 28, 2024Updated last year
- Translate subtitle file to another language☆11May 5, 2024Updated 2 years ago
- A gradio interface for making transcribed and translated subtitles for videos☆43Feb 16, 2025Updated last year
- ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free Android Developers Speech Recognition API) then TRANSLATE (usin…☆18May 5, 2024Updated 2 years ago
- ☆12Mar 25, 2024Updated 2 years ago
- A weex-template support ios android and web. dev hot-reload & can generate html & px2rem & autoprefixer.☆15Feb 1, 2018Updated 8 years ago
- MCP server for transcript processing — formatting, contextual repair & smart summarization with deep-thinking LLMs☆19Apr 7, 2026Updated last month
- Connecting Alexa with ChatGPT via custom Alexa skill to have continuous conversation with memory.☆10Mar 30, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Capture system output audio in rust.☆12Dec 18, 2025Updated 4 months ago
- MeetNote2 - Zoom Auto-Recording & Transcription App☆14Jan 8, 2025Updated last year
- Real-time multi-person pose estimation☆22Oct 19, 2018Updated 7 years ago
- Speech Recognition and Simple AI Summary:可用于本地语音转文字、说话人分割及简易的AI总结,搭配web端操作界面。☆11Jul 22, 2024Updated last year
- A Python-based tool for downloading Spotify tracks and albums as MP3 files.☆10Nov 18, 2024Updated last year
- Jinja2 Template - AdminT Dashboard (Free Version) | AppSeed☆11May 17, 2021Updated 4 years ago
- visualize history. Nuxt + D3☆12Jun 20, 2018Updated 7 years ago
- ☆11Jul 27, 2021Updated 4 years ago
- An omnipowerful personal assistant powered by LLMs, Zapier NLA, and custom actions.☆15Sep 13, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆10Nov 24, 2024Updated last year
- ☆11Jun 24, 2022Updated 3 years ago
- 🌟[ https://geekyouth.github.io/static-website-demo/ ]纯静态网址导航站点💎💎💎☆15Sep 11, 2018Updated 7 years ago
- ☆11Mar 1, 2024Updated 2 years ago
- ☆14Aug 25, 2017Updated 8 years ago
- A collections of tools around sleep research: plotting of hypnograms / spectrograms, etc etc☆10Jan 24, 2026Updated 3 months ago
- Website for generating subtitles for videos using OpenAI's Whisper Models☆11Sep 1, 2024Updated last year
- Speech-To-Text Prompter, an extension for stable-diffusion-webui using the Whisper model☆11Mar 14, 2023Updated 3 years ago
- Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)☆17Sep 18, 2025Updated 7 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Suk is a remote desktop control kit based on WebRTC and Tauri.☆17Jan 29, 2023Updated 3 years ago
- MCP server that allows Claude to have a voice.☆13May 5, 2025Updated last year
- A Python neural network made with TensorFlow that converts one person's voice into another.☆10Jan 16, 2021Updated 5 years ago
- Transform youtube URL into text 100x faster with whisperx☆20May 8, 2023Updated 2 years ago
- PyQt(+PySide) Stable Diffusion GUI☆15Aug 1, 2023Updated 2 years ago
- ☆14Nov 28, 2022Updated 3 years ago
- Slidable Panel in Swift☆11Jul 18, 2020Updated 5 years ago