Timething is a library for aligning text transcripts with their audio recordings.
β130Dec 3, 2024Updated last year
Alternatives and similar repositories for timething
Users that are interested in timething are comparing it to the libraries listed below
Sorting:
- StyleTTS2 + Vocos as a Decoderβ13Mar 24, 2025Updated 11 months ago
- Coqui STT (πΈSTT) based forced alignment toolβ13Feb 24, 2022Updated 4 years ago
- β14Aug 19, 2024Updated last year
- Segment an audio file and obtain utterance alignments. (Python package)β345May 15, 2024Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ162May 28, 2024Updated last year
- Generated Audio Samples by ALGAN-VC model are available in the folderβ19Feb 25, 2022Updated 4 years ago
- Text to speech alignment using CTC forced alignmentβ443Updated this week
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)β20Nov 14, 2019Updated 6 years ago
- β18Sep 19, 2023Updated 2 years ago
- A collection of links and notes on forced alignment toolsβ935Nov 10, 2021Updated 4 years ago
- Use a video and cut out portions of it without re-mounting the video inbetween.β15Sep 23, 2024Updated last year
- β11Mar 28, 2024Updated last year
- Non-local Modeling for Image Quality Assessmentβ13Dec 20, 2023Updated 2 years ago
- β11Dec 22, 2020Updated 5 years ago
- β13Apr 9, 2021Updated 4 years ago
- The Heracles framework for developing and evaluating text mining algorithmsβ10Jul 1, 2022Updated 3 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ16Dec 3, 2024Updated last year
- DeepSpeech based forced alignment toolβ239Dec 12, 2020Updated 5 years ago
- Deep Learning model for lexical stress detection in spoken Englishβ29Mar 17, 2020Updated 5 years ago
- aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)β2,806Jun 22, 2024Updated last year
- Use spaCy for NLP and output to the FoLiA XML format.β12Feb 27, 2024Updated 2 years ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-β¦β40Sep 18, 2024Updated last year
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ175Dec 18, 2023Updated 2 years ago
- β15Jan 11, 2024Updated 2 years ago
- MCP server to expose local zotero repository to MCP clientsβ23Jun 4, 2025Updated 8 months ago
- Forced alignment decoder for Whisper.β14Mar 13, 2024Updated last year
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILPβ14Mar 24, 2021Updated 4 years ago
- Cantonese Text to Speech with VITS implementationβ37Apr 8, 2023Updated 2 years ago
- Python forced alignmentβ95Apr 12, 2024Updated last year
- Performant and accurate speech recognition built on Pytorchβ254May 19, 2022Updated 3 years ago
- Music Modeling Kitβ22Jan 10, 2025Updated last year
- HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transformβ243Jan 14, 2025Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.β24Aug 1, 2025Updated 7 months ago
- Code for "A Dependency Syntactic Knowledge Augmented Interactive Architecture for End-to-End Aspect-based Sentiment Analysis" on Neurocomβ¦β17May 19, 2021Updated 4 years ago
- Symbolic music generation taking inspiration from NLP and human composition processβ18Jun 28, 2023Updated 2 years ago
- Utilities for patterns and globs for WebExtensionsβ23Jul 3, 2025Updated 7 months ago
- Tools to create your own voice dataset for TTS trainingβ70Oct 26, 2020Updated 5 years ago
- β57Feb 8, 2026Updated 2 weeks ago
- REPeating Pattern Extraction Technique (REPET) in Matlab for audio source separation: original REPET, REPET extended, adaptive REPET, REPβ¦β38Feb 16, 2024Updated 2 years ago