adueck / split-video-by-srtLinks
Script to split video files into chunks based on .srt timecodes
☆32Updated 7 years ago
Alternatives and similar repositories for split-video-by-srt
Users that are interested in split-video-by-srt are comparing it to the libraries listed below
Sorting:
- 📈 A forced aligner intended for synchronization of narrated text☆100Updated 3 months ago
- Timething is a library for aligning text transcripts with their audio recordings.☆126Updated 11 months ago
- 🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.☆171Updated 6 years ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆44Updated 5 years ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆160Updated last year
- Desktop application for neural speech synthesis written in C++☆214Updated 2 years ago
- One Shot Voice Cloning base on Unet-TTS☆243Updated 3 years ago
- A python package for deep multilingual punctuation prediction.☆151Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆360Updated 2 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆265Updated 3 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Community framework for training tortoise☆44Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆107Updated last year
- ☆63Updated 4 years ago
- DeepSpeech based forced alignment tool☆239Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- IPA Pronunciation Dictionaries in DSL format☆44Updated 8 years ago
- Free Dutch voice dataset☆13Updated 4 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago
- Model for recasing and repunctuating ASR transcripts☆142Updated last year
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆62Updated 4 years ago
- A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech☆457Updated last year
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.☆52Updated last year
- Postprocess SRT derived speech alignments for creating clean datasets for machine learning☆17Updated 2 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆40Updated 3 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆224Updated last year
- ☆17Updated 3 years ago
- Multilingual sentence alignment using sentence embeddings☆130Updated last year
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.☆36Updated last year