adueck / split-video-by-srt
Script to split video files into chunks based on .srt timecodes
☆31Updated 7 years ago
Alternatives and similar repositories for split-video-by-srt:
Users that are interested in split-video-by-srt are comparing it to the libraries listed below
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆27Updated 2 years ago
- Community framework for training tortoise☆41Updated 2 years ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Updated 5 years ago
- A python library to generate speech dataset from Youtube videos☆36Updated 9 months ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆53Updated 2 years ago
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆59Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆146Updated 10 months ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆142Updated 9 months ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆127Updated last year
- Tools to create your own voice dataset for TTS training☆66Updated 4 years ago
- 📈 A forced aligner intended for synchronization of narrated text☆91Updated 2 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆132Updated 11 months ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 2 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆159Updated 3 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆129Updated 3 years ago
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- A python package for deep multilingual punctuation prediction.☆119Updated 7 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆109Updated 2 years ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆124Updated 4 months ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- One Shot Voice Cloning base on Unet-TTS☆241Updated 3 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language