adueck / split-video-by-srtLinks
Script to split video files into chunks based on .srt timecodes
โ32Updated 7 years ago
Alternatives and similar repositories for split-video-by-srt
Users that are interested in split-video-by-srt are comparing it to the libraries listed below
Sorting:
- ๐ A forced aligner intended for synchronization of narrated textโ95Updated 2 years ago
- Timething is a library for aligning text transcripts with their audio recordings.โ122Updated 8 months ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabetโฆโ44Updated 5 years ago
- โ63Updated 4 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to pโฆโ52Updated 3 years ago
- Synchronize Whisper's timestamps over an existing accurate transcriptionโ154Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.โ361Updated 2 years ago
- A python library to generate speech dataset from Youtube videosโ36Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeโ149Updated last year
- One Shot Voice Cloning base on Unet-TTSโ241Updated 3 years ago
- Desktop application for neural speech synthesis written in C++โ215Updated 2 years ago
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3โ104Updated last year
- ๐ฎ Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.โ172Updated 5 years ago
- โ130Updated 2 years ago
- A python package for deep multilingual punctuation prediction.โ128Updated 11 months ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)โ20Updated 5 years ago
- A summary on our attempts at using Deep Learning approaches for Emotional Text to Speechโ455Updated last year
- DeepSpeech based forced alignment toolโ238Updated 4 years ago
- ๐ธTTS recipes for different datasetsโ86Updated 3 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogramโ253Updated last year
- IPA Pronunciation Dictionaries in DSL formatโ40Updated 8 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglowโ129Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperโ116Updated 2 years ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differenโฆโ239Updated 3 years ago
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.โ36Updated last year
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.โ259Updated 2 years ago
- Tools to create your own voice dataset for TTS trainingโ67Updated 4 years ago
- ๐ Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. ๐ง๐ฅ๐ Advanced audio processing.โ250Updated last year
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!โ351Updated 3 years ago
- Learning Lip Sync of Obama from Speech Audioโ66Updated 5 years ago