EtienneAb3d/WhisperTimeSync

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/EtienneAb3d/WhisperTimeSync)

EtienneAb3d / WhisperTimeSync

Synchronize Whisper's timestamps over an existing accurate transcription

☆165

Alternatives and similar repositories for WhisperTimeSync

Users that are interested in WhisperTimeSync are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EtienneAb3d / WhisperHallu
View on GitHub
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
☆350Nov 12, 2024Updated last year
fleek / VADtransciber
View on GitHub
☆38Dec 26, 2022Updated 3 years ago
EtienneAb3d / karaok-AI
View on GitHub
Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)
☆91Nov 23, 2023Updated 2 years ago
Fcabla / whisper_subtitler
View on GitHub
Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…
☆19Mar 10, 2023Updated 3 years ago
apptek / SubER
View on GitHub
SubER - Subtitle Edit Rate
☆26May 7, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
jianfch / stable-ts
View on GitHub
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
☆2,277May 30, 2026Updated last month
feldberlin / timething
View on GitHub
Timething is a library for aligning text transcripts with their audio recordings.
☆131Dec 3, 2024Updated last year
linto-ai / whisper-timestamped
View on GitHub
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
☆2,829Sep 9, 2025Updated 10 months ago
YuanGongND / whisper-at
View on GitHub
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …
☆421Feb 21, 2024Updated 2 years ago
pengzhendong / streaming-vocos
View on GitHub
Streaming Vocos
☆31Jun 10, 2025Updated last year
souvikg544 / TTS_Data_Maker
View on GitHub
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…
☆28Mar 14, 2023Updated 3 years ago
omogr / omogre
View on GitHub
Russian accentuator and IPA transcriber
☆17Jul 18, 2026Updated last week
geekodour / wscribe
View on GitHub
ez audio transcription tool with flexible processing and post-processing options
☆171Feb 1, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
nyrahealth / CrisperWhisper
View on GitHub
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
☆994Updated this week
audio-captioning / caption-evaluation-tools
View on GitHub
Tools for the evaluation of audio captioning.
☆19May 23, 2020Updated 6 years ago
Kartoffel0 / Memo2Anki
View on GitHub
Mine from pdfs created with Mokuro2Pdf
☆10Dec 13, 2024Updated last year
trinhtuanvubk / Diff-VC
View on GitHub
Diffusion Model for Voice Conversion
☆72Mar 14, 2024Updated 2 years ago
boun-tabi / SQuAD-TR
View on GitHub
☆11Jun 8, 2024Updated 2 years ago
NavodPeiris / speechlib
View on GitHub
Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts…
☆266Apr 19, 2026Updated 3 months ago
ml-for-speech / speechtoolkit
View on GitHub
[Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…
☆22Jan 10, 2025Updated last year
lstrgar / ss-phoneme-seg
View on GitHub
Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…
☆55Nov 4, 2022Updated 3 years ago
Peterbotliang / keras-audio-super-resolution
View on GitHub
SEGAN for bandwidth extension
☆15Jun 6, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mirix / approaches-to-diarisation
View on GitHub
A testing repo to share code and thoughts on diarisation
☆58Mar 26, 2024Updated 2 years ago
BriansIDP / WhisperBiasing
View on GitHub
☆88Jul 31, 2025Updated 11 months ago
ebu / benchmarkstt
View on GitHub
Open Source AI Benchmarking toolkit for benchmarking speech to text services
☆59Apr 17, 2024Updated 2 years ago
alphacep / whisper-prompts
View on GitHub
OpenAI Whisper Prompt Examples
☆53Jul 17, 2023Updated 3 years ago
Raffaelbdl / kuma-browser
View on GitHub
Add-on to use JPDB directly inside Anki
☆16Apr 18, 2026Updated 3 months ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
BornInWater / Overlap-Detection
View on GitHub
Overlapped Speech detection in Multi-party Conversations
☆22Feb 20, 2018Updated 8 years ago
sevengivings / subtitle-extractor
View on GitHub
A Python script for AI speech recognition of video or audio file using whisper, stable-ts or faster-whisper and translation of subtitle u…
☆10Feb 17, 2025Updated last year
Dschogo / whisperx-webui
View on GitHub
Transcribe with ease :D
☆16Jun 21, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
kaegi / alass
View on GitHub
"Automatic Language-Agnostic Subtitle Synchronization"
☆1,430Dec 28, 2023Updated 2 years ago
JonathanFly / faster-whisper-livestream-translator
View on GitHub
faster-whisper livestream translation, OBS noise reduction, dual language subtitles
☆82Apr 26, 2023Updated 3 years ago
5Hyeons / StyleTTS2-Vocos
View on GitHub
StyleTTS2 + Vocos as a Decoder
☆13Mar 24, 2025Updated last year
PecholaL / MAIN-VC
View on GitHub
Lightweight Speech Representation Learning for One-Shot Voice Conversion
☆23Dec 12, 2024Updated last year
pettarin / forced-alignment-tools
View on GitHub
A collection of links and notes on forced alignment tools
☆942Updated this week
TigreGotico / chatterbox-onnx
View on GitHub
chatterbox TTS + Voice Clone using onnx
☆28Updated this week