Timething is a library for aligning text transcripts with their audio recordings.
β130Dec 3, 2024Updated last year
Alternatives and similar repositories for timething
Users that are interested in timething are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Segment an audio file and obtain utterance alignments. (Python package)β346May 15, 2024Updated last year
- Coqui STT (πΈSTT) based forced alignment toolβ13Feb 24, 2022Updated 4 years ago
- π A forced aligner intended for synchronization of narrated textβ102Aug 9, 2025Updated 8 months ago
- Russian phonetical transcriptionβ11Nov 19, 2025Updated 4 months ago
- Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, spβ¦β440Sep 1, 2025Updated 7 months ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ164May 28, 2024Updated last year
- aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)β2,818Jun 22, 2024Updated last year
- Forced alignment decoder for Whisper.β15Mar 13, 2024Updated 2 years ago
- Non-local Modeling for Image Quality Assessmentβ13Dec 20, 2023Updated 2 years ago
- β14Aug 19, 2024Updated last year
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-β¦β41Sep 18, 2024Updated last year
- Generated Audio Samples by ALGAN-VC model are available in the folderβ19Feb 25, 2022Updated 4 years ago
- DeepSpeech based forced alignment toolβ239Dec 12, 2020Updated 5 years ago
- Text to speech alignment using CTC forced alignmentβ478Feb 23, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used toβ¦β36Jan 16, 2021Updated 5 years ago
- A collection of links and notes on forced alignment toolsβ938Nov 10, 2021Updated 4 years ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)β20Nov 14, 2019Updated 6 years ago
- Python forced alignmentβ95Apr 12, 2024Updated last year
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR modelsβ¦β19Mar 10, 2023Updated 3 years ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ176Dec 18, 2023Updated 2 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) β python package for placing stress in Russian text using RNN (BiLSTβ¦β45Aug 7, 2024Updated last year
- Performant and accurate speech recognition built on Pytorchβ254May 19, 2022Updated 3 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesisβ45Jul 24, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Deep Visual Speech Recognition in arabic wordsβ16Oct 18, 2023Updated 2 years ago
- β18Sep 19, 2023Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.β25Aug 1, 2025Updated 8 months ago
- Deep Learning model for lexical stress detection in spoken Englishβ29Mar 17, 2020Updated 6 years ago
- β11Mar 28, 2024Updated 2 years ago
- β144Sep 8, 2025Updated 7 months ago
- Use spaCy for NLP and output to the FoLiA XML format.β12Feb 27, 2024Updated 2 years ago
- Description This project aims to create a system for a car rental company that can help them keep track of car rentals and manage them.β10Aug 23, 2022Updated 3 years ago
- The Heracles framework for developing and evaluating text mining algorithmsβ10Jul 1, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Binary builds of FFmpeg for the PyAV projectβ21Mar 24, 2026Updated 2 weeks ago
- Wav2Lip model Windows GUI Program using PyQT5β19Jun 4, 2021Updated 4 years ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β412Feb 21, 2024Updated 2 years ago
- β58Feb 8, 2026Updated 2 months ago
- A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singinβ¦β41Aug 18, 2024Updated last year
- Code for "A Dependency Syntactic Knowledge Augmented Interactive Architecture for End-to-End Aspect-based Sentiment Analysis" on Neurocomβ¦β17May 19, 2021Updated 4 years ago
- β32Jan 6, 2022Updated 4 years ago