Timething is a library for aligning text transcripts with their audio recordings.
β130Dec 3, 2024Updated last year
Alternatives and similar repositories for timething
Users that are interested in timething are comparing it to the libraries listed below
Sorting:
- Segment an audio file and obtain utterance alignments. (Python package)β346May 15, 2024Updated last year
- Coqui STT (πΈSTT) based forced alignment toolβ13Feb 24, 2022Updated 4 years ago
- Russian phonetical transcriptionβ11Nov 19, 2025Updated 4 months ago
- Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, spβ¦β440Sep 1, 2025Updated 6 months ago
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ163May 28, 2024Updated last year
- aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)β2,811Jun 22, 2024Updated last year
- Forced alignment decoder for Whisper.β15Mar 13, 2024Updated 2 years ago
- Non-local Modeling for Image Quality Assessmentβ13Dec 20, 2023Updated 2 years ago
- β14Aug 19, 2024Updated last year
- Forced alignment for karaokesβ18Updated this week
- Text to speech alignment using CTC forced alignmentβ463Feb 23, 2026Updated 3 weeks ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-β¦β40Sep 18, 2024Updated last year
- Generated Audio Samples by ALGAN-VC model are available in the folderβ19Feb 25, 2022Updated 4 years ago
- DeepSpeech based forced alignment toolβ239Dec 12, 2020Updated 5 years ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used toβ¦β36Jan 16, 2021Updated 5 years ago
- A collection of links and notes on forced alignment toolsβ936Nov 10, 2021Updated 4 years ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)β20Nov 14, 2019Updated 6 years ago
- Python forced alignmentβ95Apr 12, 2024Updated last year
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ176Dec 18, 2023Updated 2 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) β python package for placing stress in Russian text using RNN (BiLSTβ¦β45Aug 7, 2024Updated last year
- Performant and accurate speech recognition built on Pytorchβ254May 19, 2022Updated 3 years ago
- β18Sep 19, 2023Updated 2 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesisβ44Jul 24, 2023Updated 2 years ago
- Deep Visual Speech Recognition in arabic wordsβ16Oct 18, 2023Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.β24Aug 1, 2025Updated 7 months ago
- Deep Learning model for lexical stress detection in spoken Englishβ29Mar 17, 2020Updated 6 years ago
- β11Mar 28, 2024Updated last year
- Use spaCy for NLP and output to the FoLiA XML format.β12Feb 27, 2024Updated 2 years ago
- Description This project aims to create a system for a car rental company that can help them keep track of car rentals and manage them.β10Aug 23, 2022Updated 3 years ago
- The Heracles framework for developing and evaluating text mining algorithmsβ10Jul 1, 2022Updated 3 years ago
- My Letter Beautiful Mysterious Notebook.β13Oct 20, 2021Updated 4 years ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β412Feb 21, 2024Updated 2 years ago
- β58Feb 8, 2026Updated last month
- A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singinβ¦β40Aug 18, 2024Updated last year
- β32Jan 6, 2022Updated 4 years ago
- HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transformβ247Jan 14, 2025Updated last year
- Code for "A Dependency Syntactic Knowledge Augmented Interactive Architecture for End-to-End Aspect-based Sentiment Analysis" on Neurocomβ¦β17May 19, 2021Updated 4 years ago
- Tool to make high quality text to speech (tts) corpus from audio + text books.β28Jul 31, 2025Updated 7 months ago
- Transcription, forced alignment, and audio indexing with OpenAI's Whisperβ2,181Oct 29, 2025Updated 4 months ago