glowinthedark / subtitles-ocrLinks
Hard-burned subtitles OCR to SRT extractor
☆20Updated last year
Alternatives and similar repositories for subtitles-ocr
Users that are interested in subtitles-ocr are comparing it to the libraries listed below
Sorting:
- Extract hardcoded subtitles from videos using machine learning☆180Updated last week
- Convert bitmap subtitles into SubRip format using the macOS Vision framework☆28Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- Uses machine learning to denoise audio containing speech☆34Updated 11 months ago
- A deep learning lyrics-to-audio alignment system, generating synchronized lyrics from a song and its lyrics☆42Updated 2 years ago
- A library for parsing and manipulating Advanced SubStation Alpha subtitle files.☆43Updated last year
- Translate a .SRT file using DeepL and Selenium☆60Updated 2 years ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆112Updated this week
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆20Updated last month
- scrape subtitles from opensubtitles.org☆35Updated last month
- Accelerating faster-whisper single file processing by multiprocessing through parallelization☆54Updated 2 years ago
- SRT files translator☆236Updated 7 months ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Updated 5 years ago
- Automation package for everything related to encoding and subbing☆20Updated 2 weeks ago
- A Colab Notebook for OpenAI Whisper and DeepL API, aiming to create human-comparable results of translation and transcription.☆29Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Advanced delogo plugin for AviSynth+☆90Updated 2 months ago
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆13Updated 6 months ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆152Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.☆119Updated 6 months ago
- ☆48Updated 3 years ago
- Gimp plugins to extract text from images (Bubble/Balloons)☆12Updated 11 months ago
- OpenAI Whisper Prompt Examples☆52Updated last year
- A simple script to prepare dataset for training with TTS Tortoise model via https://git.ecker.tech/mrq/ai-voice-cloning☆12Updated last year
- Chrome extension for loading subtitles from Netflix web series.☆39Updated 5 years ago
- A python script that takes an input MP3/FLAC and outputs an acapella/background noise stripped WAV using the power of NVIDIA's RTX Voice☆89Updated 3 months ago
- ☆20Updated 7 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆52Updated 2 years ago
- 📈 A forced aligner intended for synchronization of narrated text☆93Updated 2 years ago
- On-device noise suppression powered by deep learning☆70Updated last month