Timething is a library for aligning text transcripts with their audio recordings.
☆131Dec 3, 2024Updated last year
Alternatives and similar repositories for timething
Users that are interested in timething are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Segment an audio file and obtain utterance alignments. (Python package)☆347May 15, 2024Updated 2 years ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated last year
- Coqui STT (🐸STT) based forced alignment tool☆13Feb 24, 2022Updated 4 years ago
- Russian phonetical transcription☆11May 20, 2026Updated last month
- Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, sp…☆446Apr 21, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Synchronize Whisper's timestamps over an existing accurate transcription☆165May 28, 2024Updated 2 years ago
- aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)☆2,846Jun 22, 2024Updated 2 years ago
- Forced alignment decoder for Whisper.☆16Mar 13, 2024Updated 2 years ago
- Non-local Modeling for Image Quality Assessment☆13Dec 20, 2023Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆43Sep 18, 2024Updated last year
- Generated Audio Samples by ALGAN-VC model are available in the folder☆19Feb 25, 2022Updated 4 years ago
- Text to speech alignment using CTC forced alignment☆510Apr 15, 2026Updated 2 months ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆39Jan 16, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Nov 14, 2019Updated 6 years ago
- Python forced alignment☆95Apr 12, 2024Updated 2 years ago
- Forced alignment for karaokes☆24Updated this week
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Mar 10, 2023Updated 3 years ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models☆175Dec 18, 2023Updated 2 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆45Aug 7, 2024Updated last year
- Performant and accurate speech recognition built on Pytorch☆254May 19, 2022Updated 4 years ago
- MCP server to expose local zotero repository to MCP clients☆29Jun 4, 2025Updated last year
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆45Jul 24, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Deep Visual Speech Recognition in arabic words☆16Oct 18, 2023Updated 2 years ago
- ☆18Sep 19, 2023Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 10 months ago
- Deep Learning model for lexical stress detection in spoken English☆28Mar 17, 2020Updated 6 years ago
- ☆12Mar 28, 2024Updated 2 years ago
- ☆155Sep 8, 2025Updated 9 months ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- Description This project aims to create a system for a car rental company that can help them keep track of car rentals and manage them.☆10Aug 23, 2022Updated 3 years ago
- The Heracles framework for developing and evaluating text mining algorithms☆10Jul 1, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆420Feb 21, 2024Updated 2 years ago
- ☆58Feb 8, 2026Updated 4 months ago
- HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform☆253Jan 14, 2025Updated last year
- Code for "A Dependency Syntactic Knowledge Augmented Interactive Architecture for End-to-End Aspect-based Sentiment Analysis" on Neurocom…☆17May 19, 2021Updated 5 years ago
- A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singin…☆44Aug 18, 2024Updated last year
- Tool to make high quality text to speech (tts) corpus from audio + text books.☆28Jul 31, 2025Updated 11 months ago
- Blueprint management and straightforward (de)serialization + validation in Flask☆14Oct 10, 2018Updated 7 years ago