adueck / split-video-by-srt
Script to split video files into chunks based on .srt timecodes
☆31Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for split-video-by-srt
- A python library to generate speech dataset from Youtube videos☆35Updated 5 months ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆43Updated 4 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆50Updated 2 years ago
- One-button-press forced aligner for Japanese, using Julius.☆44Updated last year
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 3 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆235Updated 5 years ago
- A python package for deep multilingual punctuation prediction.☆98Updated 3 months ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆130Updated 7 months ago
- ☆18Updated 2 years ago
- Multilingual Grapheme to Phoneme☆49Updated 8 years ago
- A gui to help make a text to speech dataset.☆18Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆99Updated last year
- ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.☆68Updated 2 years ago
- Postprocess SRT derived speech alignments for creating clean datasets for machine learning☆17Updated last year
- Interface for Controllable Expressive Talking Machine☆38Updated 10 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆37Updated last year
- VCTK multi-speaker tacotron for ICASSP 2020☆265Updated 2 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated last year
- Collect Voice Conversion researches☆90Updated this week
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆54Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆141Updated 6 months ago
- Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTK☆58Updated 7 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆237Updated last year
- My guide to create an italian TTS with Coqui☆14Updated 2 years ago
- ☆64Updated 3 years ago
- Python forced alignment☆73Updated 7 months ago
- ☆77Updated 5 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year