adueck / split-video-by-srt
Script to split video files into chunks based on .srt timecodes
☆31Updated 7 years ago
Alternatives and similar repositories for split-video-by-srt
Users that are interested in split-video-by-srt are comparing it to the libraries listed below
Sorting:
- A gui to help make a text to speech dataset.☆18Updated 2 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆53Updated 3 years ago
- Tools to create your own voice dataset for TTS training☆66Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- A python library to generate speech dataset from Youtube videos☆36Updated 11 months ago
- 📈 A forced aligner intended for synchronization of narrated text☆93Updated 2 years ago
- ☆17Updated 2 years ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆34Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 4 years ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Updated 5 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆91Updated last year
- Community framework for training tortoise☆41Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- A public domain single speaker Japanese speech dataset☆53Updated last year
- Postprocess SRT derived speech alignments for creating clean datasets for machine learning☆17Updated 2 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆160Updated last year
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- IPA Pronunciation Dictionaries in DSL format☆40Updated 8 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆243Updated 5 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- Grapheme To Phoneme☆73Updated 9 months ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆23Updated 3 months ago
- A python package for deep multilingual punctuation prediction.☆123Updated 8 months ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆128Updated 2 years ago
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments☆43Updated 4 years ago
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- [WIP] VoiceSmith makes training text to speech models easy.☆224Updated 2 years ago