feldberlin/timething

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/feldberlin/timething)

feldberlin / timething

Timething is a library for aligning text transcripts with their audio recordings.

☆131

Alternatives and similar repositories for timething

Users that are interested in timething are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

5Hyeons / StyleTTS2-Vocos
View on GitHub
StyleTTS2 + Vocos as a Decoder
☆13Mar 24, 2025Updated last year
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆348May 15, 2024Updated 2 years ago
gullabi / STT-align
View on GitHub
Coqui STT (🐸STT) based forced alignment tool
☆13Feb 24, 2022Updated 4 years ago
suralmasha / RuTranscript
View on GitHub
Russian phonetical transcription
☆11May 20, 2026Updated 2 months ago
echogarden-project / echogarden
View on GitHub
Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, sp…
☆445Apr 21, 2026Updated 3 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
SandyPanda-MLDL / ALGAN-VC-Generated-Audio-Samples
View on GitHub
Generated Audio Samples by ALGAN-VC model are available in the folder
☆19Feb 25, 2022Updated 4 years ago
EtienneAb3d / WhisperTimeSync
View on GitHub
Synchronize Whisper's timestamps over an existing accurate transcription
☆165May 28, 2024Updated 2 years ago
readbeyond / aeneas
View on GitHub
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
☆2,852Jun 22, 2024Updated 2 years ago
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
SuperBruceJia / NLNet-IQA
View on GitHub
Non-local Modeling for Image Quality Assessment
☆13Dec 20, 2023Updated 2 years ago
tobiasrordorf / SRT-to-CSV-and-audio-split
View on GitHub
Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)
☆20Nov 14, 2019Updated 6 years ago
repodiac / german_transliterate
View on GitHub
Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…
☆39Jan 16, 2021Updated 5 years ago
mozilla / DSAlign
View on GitHub
DeepSpeech based forced alignment tool
☆239Dec 12, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
maxrmorrison / pyfoal
View on GitHub
Python forced alignment
☆95Apr 12, 2024Updated 2 years ago
pettarin / forced-alignment-tools
View on GitHub
A collection of links and notes on forced alignment tools
☆941Apr 18, 2026Updated 3 months ago
MahmoudAshraf97 / ctc-forced-aligner
View on GitHub
Text to speech alignment using CTC forced alignment
☆523Jul 12, 2026Updated last week
neonbjb / tts-scores
View on GitHub
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
☆175Dec 18, 2023Updated 2 years ago
Fcabla / whisper_subtitler
View on GitHub
Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…
☆19Mar 10, 2023Updated 3 years ago
neonbjb / ocotillo
View on GitHub
Performant and accurate speech recognition built on Pytorch
☆254May 19, 2022Updated 4 years ago
KSchouten / Heracles
View on GitHub
The Heracles framework for developing and evaluating text mining algorithms
☆10Jul 1, 2022Updated 4 years ago
seahore / PPG-GradVC
View on GitHub
A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis
☆45Jul 24, 2023Updated 2 years ago
Mu-Y / DiariST
View on GitHub
☆18Sep 19, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
thepowerfuldeez / rvc-trainer
View on GitHub
☆12Mar 28, 2024Updated 2 years ago
LexicalStressDetection / lexical-stress-detection
View on GitHub
Deep Learning model for lexical stress detection in spoken English
☆28Mar 17, 2020Updated 6 years ago
proycon / spacy2folia
View on GitHub
Use spaCy for NLP and output to the FoLiA XML format.
☆12Feb 27, 2024Updated 2 years ago
ydqmkkx / Respiro-en
View on GitHub
Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…
☆44Sep 18, 2024Updated last year
boris-kuz / jaxloudnorm
View on GitHub
Jax implementation of a flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
☆13Jan 29, 2025Updated last year
YuanGongND / whisper-at
View on GitHub
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …
☆421Feb 21, 2024Updated 2 years ago
spicytigermeat / LabelMakr
View on GitHub
A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singin…
☆46Aug 18, 2024Updated last year
yl4579 / HiFTNet
View on GitHub
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
☆256Jan 14, 2025Updated last year
XL2248 / DREGCN
View on GitHub
Code for "A Dependency Syntactic Knowledge Augmented Interactive Architecture for End-to-End Aspect-based Sentiment Analysis" on Neurocom…
☆17May 19, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
botbahlul / Live-Subtitle-V2
View on GitHub
ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free Android Developers Speech Recognition API) then TRANSLATE (usin…
☆14May 5, 2024Updated 2 years ago
madhu1995-oss / Pronunciation-and-Fluency-evaluation-using-machne-learning-and-DeepLearning
View on GitHub
☆13Apr 9, 2021Updated 5 years ago
alphacep / awesome-speech
View on GitHub
Resources that make every language unique
☆32Updated this week
JonathanFly / faster-whisper-livestream-translator
View on GitHub
faster-whisper livestream translation, OBS noise reduction, dual language subtitles
☆82Apr 26, 2023Updated 3 years ago
IIEleven11 / Automatic-Audio-Dataset-Maker
View on GitHub
Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.
☆48Sep 15, 2025Updated 10 months ago
jianfch / stable-ts
View on GitHub
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
☆2,277May 30, 2026Updated last month
zhuole1025 / LyricWhiz
View on GitHub
[ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
☆55Nov 20, 2023Updated 2 years ago