Synchronize Whisper's timestamps over an existing accurate transcription
☆164May 28, 2024Updated last year
Alternatives and similar repositories for WhisperTimeSync
Users that are interested in WhisperTimeSync are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆350Nov 12, 2024Updated last year
- SubER - Subtitle Edit Rate☆24Feb 19, 2026Updated 2 months ago
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆88Nov 23, 2023Updated 2 years ago
- ☆38Dec 26, 2022Updated 3 years ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Mar 10, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆12Nov 7, 2024Updated last year
- Transcription, forced alignment, and audio indexing with OpenAI's Whisper☆2,229Oct 29, 2025Updated 6 months ago
- Timething is a library for aligning text transcripts with their audio recordings.☆130Dec 3, 2024Updated last year
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆2,808Sep 9, 2025Updated 7 months ago
- ez audio transcription tool with flexible processing and post-processing options☆167Feb 1, 2024Updated 2 years ago
- Streaming Vocos☆31Jun 10, 2025Updated 10 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆414Feb 21, 2024Updated 2 years ago
- Russian accentuator and IPA transcriber☆16Sep 10, 2024Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Mar 14, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆45Aug 30, 2024Updated last year
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆942Jun 3, 2025Updated 11 months ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆59Apr 17, 2024Updated 2 years ago
- Diffusion Model for Voice Conversion☆71Mar 14, 2024Updated 2 years ago
- ☆10Jun 8, 2024Updated last year
- MCP server to expose local zotero repository to MCP clients☆26Jun 4, 2025Updated 11 months ago
- A Python script for AI speech recognition of video or audio file using whisper, stable-ts or faster-whisper and translation of subtitle u…☆10Feb 17, 2025Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts…☆258Apr 19, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆32Nov 24, 2024Updated last year
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…☆55Nov 4, 2022Updated 3 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- A lightweight muji-moe chatbot created by Reecho.ai.☆13Oct 1, 2024Updated last year
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆41Sep 18, 2024Updated last year
- Transcribe with ease :D☆16Jun 21, 2023Updated 2 years ago
- A testing repo to share code and thoughts on diarisation☆57Mar 26, 2024Updated 2 years ago
- SEGAN for bandwidth extension☆15Jun 6, 2019Updated 6 years ago
- ☆88Jul 31, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆54Jul 16, 2025Updated 9 months ago
- OpenAI Whisper Prompt Examples☆53Jul 17, 2023Updated 2 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Feb 20, 2018Updated 8 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆21,615Apr 4, 2026Updated last month
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆18Nov 19, 2025Updated 5 months ago
- "Automatic Language-Agnostic Subtitle Synchronization"☆1,391Dec 28, 2023Updated 2 years ago