Create Youtube SRT with WhisperX using Google Colab
☆21May 16, 2023Updated 2 years ago
Alternatives and similar repositories for WhisperX-Youtube-SRT
Users that are interested in WhisperX-Youtube-SRT are comparing it to the libraries listed below
Sorting:
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Mar 10, 2023Updated 3 years ago
- colab list for video☆10Jan 29, 2026Updated last month
- WanImageToVideo ComfyUI node, with Tiled VAE☆15Oct 22, 2025Updated 4 months ago
- STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation☆73Nov 11, 2025Updated 3 months ago
- A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE…☆11May 5, 2024Updated last year
- A Python-based tool for downloading Spotify tracks and albums as MP3 files.☆10Nov 18, 2024Updated last year
- Movie Web with real data film☆11Sep 24, 2022Updated 3 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- ☆23Jan 1, 2026Updated 2 months ago
- ☆14May 11, 2023Updated 2 years ago
- ☆12May 30, 2025Updated 9 months ago
- colab list for image☆19Nov 18, 2025Updated 3 months ago
- Command-line interface wrapper for https://cobalt.tools, written in rust☆13Oct 19, 2024Updated last year
- Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD☆10Mar 31, 2021Updated 4 years ago
- A Tools for Grabbing Free Proxy + Auto Check Live☆11Jun 16, 2023Updated 2 years ago
- 开发成长路上☆10Dec 25, 2018Updated 7 years ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated last month
- ☆11Nov 2, 2024Updated last year
- LLM-based character segmentation agent for ComfyUI based on SAM 3 and the SAM 3 Agent notebook☆25Dec 22, 2025Updated 2 months ago
- Extension for Forge-based UIs (Forge, reForge, etc) and ComfyUI to replace CFG with Negative Rejection Steering☆16Feb 14, 2026Updated 3 weeks ago
- ☆10May 25, 2021Updated 4 years ago
- SimplifiedTransformer simplifies transformer block without affecting training. Skip connections, projection parameters, sequential sub-bl…☆15Feb 6, 2026Updated last month
- Widevine L3 CDM Give away☆11Jan 12, 2023Updated 3 years ago
- Automatic audio transcription to .srt using Google's Speech to Text API☆12Oct 26, 2020Updated 5 years ago
- Convert Niconico live comments to play on old and new Niconico video tools, as well as Youtube, Twitch comments for use on them.☆12Mar 26, 2025Updated 11 months ago
- Widevine L3 CDM Give away☆14Jan 12, 2023Updated 3 years ago
- This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Fea…☆11Sep 27, 2022Updated 3 years ago
- TLBVFI Video Frame Interpolaton for ComfyUI☆22Feb 18, 2026Updated 2 weeks ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 10 months ago
- VSFilterMod with VapourSynth interface added☆43Dec 5, 2023Updated 2 years ago
- Dataset simulation for DPCCN.☆16Dec 25, 2022Updated 3 years ago
- Speech-To-Text Prompter, an extension for stable-diffusion-webui using the Whisper model☆11Mar 14, 2023Updated 2 years ago
- SinGlow is a part of my Singing voice synthesis system. It can extract features of sound, particularly songs and musics. Then we can use …☆11Oct 9, 2021Updated 4 years ago
- A collection of macros and modules that seek to add some Illustrator like functionality to Aegisub☆15Updated this week
- ☆16Jun 15, 2022Updated 3 years ago
- An adapter layer that ensures torch_musa🔦 delivers a CUDA-compatible PyTorch experience.☆29Updated this week
- This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets re…☆13Oct 8, 2025Updated 5 months ago
- Auto-Video maker handling many AI's☆11Mar 18, 2024Updated last year