adueck / split-video-by-srtLinks
Script to split video files into chunks based on .srt timecodes
☆31Updated 7 years ago
Alternatives and similar repositories for split-video-by-srt
Users that are interested in split-video-by-srt are comparing it to the libraries listed below
Sorting:
- 📈 A forced aligner intended for synchronization of narrated text☆93Updated 2 years ago
- Postprocess SRT derived speech alignments for creating clean datasets for machine learning☆17Updated 2 years ago
- Desktop application for neural speech synthesis written in C++☆215Updated 2 years ago
- 🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.☆172Updated 5 years ago
- Python interface for forced audio alignment using HTK and SoX☆341Updated 5 years ago
- ☆63Updated 4 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆122Updated 7 months ago
- A collection of links and notes on forced alignment tools☆919Updated 3 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆361Updated last year
- A tool for automatic phoneme transcription☆157Updated 2 years ago
- A gui to help make a text to speech dataset.☆18Updated 2 years ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆641Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆257Updated 2 years ago
- Massively multilingual pronunciation mining☆344Updated last month
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆104Updated last year
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆243Updated 5 years ago
- A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech☆453Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆362Updated 2 years ago
- A python library to generate speech dataset from Youtube videos☆36Updated last year
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆351Updated 3 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 4 years ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆153Updated last year
- One Shot Voice Cloning base on Unet-TTS☆242Updated 3 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆44Updated 4 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- DeepSpeech based forced alignment tool☆238Updated 4 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 3 years ago
- 🐸TTS recipes for different datasets☆86Updated 2 years ago