Synchronize Whisper's timestamps over an existing accurate transcription
☆163May 28, 2024Updated last year
Alternatives and similar repositories for WhisperTimeSync
Users that are interested in WhisperTimeSync are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆348Nov 12, 2024Updated last year
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆89Nov 23, 2023Updated 2 years ago
- ☆38Dec 26, 2022Updated 3 years ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Mar 10, 2023Updated 3 years ago
- ☆12Nov 7, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Transcription, forced alignment, and audio indexing with OpenAI's Whisper☆2,186Oct 29, 2025Updated 4 months ago
- Timething is a library for aligning text transcripts with their audio recordings.☆130Dec 3, 2024Updated last year
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆2,783Sep 9, 2025Updated 6 months ago
- ez audio transcription tool with flexible processing and post-processing options☆165Feb 1, 2024Updated 2 years ago
- Streaming Vocos☆30Jun 10, 2025Updated 9 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆412Feb 21, 2024Updated 2 years ago
- Russian accentuator and IPA transcriber☆16Sep 10, 2024Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Mar 14, 2023Updated 3 years ago
- Tools for the evaluation of audio captioning.☆19May 23, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆927Jun 3, 2025Updated 9 months ago
- ☆44Aug 30, 2024Updated last year
- Mine from pdfs created with Mokuro2Pdf☆10Dec 13, 2024Updated last year
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆58Apr 17, 2024Updated last year
- Diffusion Model for Voice Conversion☆69Mar 14, 2024Updated 2 years ago
- A Python script for AI speech recognition of video or audio file using whisper, stable-ts or faster-whisper and translation of subtitle u…☆10Feb 17, 2025Updated last year
- ☆10Jun 8, 2024Updated last year
- MCP server to expose local zotero repository to MCP clients☆25Jun 4, 2025Updated 9 months ago
- ☆32Nov 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆252Feb 10, 2026Updated last month
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…☆55Nov 4, 2022Updated 3 years ago
- A lightweight muji-moe chatbot created by Reecho.ai.☆13Oct 1, 2024Updated last year
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆41Sep 18, 2024Updated last year
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- Transcribe with ease :D☆16Jun 21, 2023Updated 2 years ago
- A testing repo to share code and thoughts on diarisation☆57Mar 26, 2024Updated 2 years ago
- ☆17Jan 31, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- SEGAN for bandwidth extension☆15Jun 6, 2019Updated 6 years ago
- ☆86Jul 31, 2025Updated 7 months ago
- ☆55Jul 16, 2025Updated 8 months ago
- ☆18Nov 8, 2022Updated 3 years ago
- ☆19Jul 22, 2025Updated 8 months ago
- OpenAI Whisper Prompt Examples☆53Jul 17, 2023Updated 2 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆20,821Mar 17, 2026Updated last week