saurabhshri / CCAligner
๐ฎ Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
โ170Updated 5 years ago
Alternatives and similar repositories for CCAligner:
Users that are interested in CCAligner are comparing it to the libraries listed below
- DeepSpeech based forced alignment toolโ237Updated 4 years ago
- A collection of links and notes on forced alignment toolsโ902Updated 3 years ago
- A node module to generate subtitles by segmenting a list of time-coded text - BBC News Labsโ49Updated last year
- ๐ A forced aligner intended for synchronization of narrated textโ91Updated 2 years ago
- Python interface for forced audio alignment using HTK and SoXโ337Updated 4 years ago
- Timething is a library for aligning text transcripts with their audio recordings.โ117Updated 4 months ago
- Synchronize Whisper's timestamps over an existing accurate transcriptionโ145Updated 10 months ago
- An audio/acoustic activity detection and audio segmentation toolโ771Updated 4 months ago
- An HTML interface for finetuning the sync map output from aeneasโ53Updated 2 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.โ246Updated 2 years ago
- Script to split video files into chunks based on .srt timecodesโ31Updated 7 years ago
- Automated lyrics-to-audio alignment using syllabic nuclei detection. Developed during Google Summer of Code 2019.โ50Updated last year
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting โฆโ326Updated last year
- A tool for automatic phoneme transcriptionโ157Updated last year
- โ80Updated last year
- Package for aligning audio files through audio fingerprintingโ109Updated last month
- Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structurโฆโ91Updated 7 years ago
- Command line utility for forced alignment using Kaldiโ1,442Updated 3 weeks ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languagesโ470Updated 5 years ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languagesโ618Updated 11 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.โ309Updated 5 months ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender โฆโ797Updated 3 months ago
- g2p: English Grapheme To Phoneme Conversionโ848Updated 2 years ago
- DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.โ356Updated 4 years ago
- Synchronize your subtitles using machine learningโ152Updated last year
- End-2-end speech synthesis with recurrent neural networksโ226Updated last year
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognitionโฆโ98Updated 3 years ago
- Phonetisaurus G2Pโ469Updated 10 months ago
- Subtitles as a language learning toolโ69Updated last year
- A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)โ701Updated 3 weeks ago