Synchronize Whisper's timestamps over an existing accurate transcription
☆164May 28, 2024Updated 2 years ago
Alternatives and similar repositories for WhisperTimeSync
Users that are interested in WhisperTimeSync are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆350Nov 12, 2024Updated last year
- SubER - Subtitle Edit Rate☆24May 7, 2026Updated last month
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆88Nov 23, 2023Updated 2 years ago
- ☆38Dec 26, 2022Updated 3 years ago
- ☆12Nov 7, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Transcription, forced alignment, and audio indexing with OpenAI's Whisper☆2,256May 30, 2026Updated 2 weeks ago
- Timething is a library for aligning text transcripts with their audio recordings.☆132Dec 3, 2024Updated last year
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆2,818Sep 9, 2025Updated 9 months ago
- ez audio transcription tool with flexible processing and post-processing options☆168Feb 1, 2024Updated 2 years ago
- Streaming Vocos☆31Jun 10, 2025Updated last year
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆419Feb 21, 2024Updated 2 years ago
- Russian accentuator and IPA transcriber☆16Sep 10, 2024Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Mar 14, 2023Updated 3 years ago
- ☆46May 13, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆957Jun 3, 2025Updated last year
- Mine from pdfs created with Mokuro2Pdf☆10Dec 13, 2024Updated last year
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆59Apr 17, 2024Updated 2 years ago
- Diffusion Model for Voice Conversion☆71Mar 14, 2024Updated 2 years ago
- ☆10Jun 8, 2024Updated 2 years ago
- MCP server to expose local zotero repository to MCP clients☆29Jun 4, 2025Updated last year
- A Python script for AI speech recognition of video or audio file using whisper, stable-ts or faster-whisper and translation of subtitle u…☆10Feb 17, 2025Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- ☆32Nov 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…☆55Nov 4, 2022Updated 3 years ago
- Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts…☆265Apr 19, 2026Updated last month
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- A lightweight muji-moe chatbot created by Reecho.ai.☆13Oct 1, 2024Updated last year
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆42Sep 18, 2024Updated last year
- Transcribe with ease :D☆16Jun 21, 2023Updated 2 years ago
- A testing repo to share code and thoughts on diarisation☆58Mar 26, 2024Updated 2 years ago
- ☆17Jan 31, 2023Updated 3 years ago
- SEGAN for bandwidth extension☆15Jun 6, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆88Jul 31, 2025Updated 10 months ago
- Based off of https://huggingface.co/spaces/Nick088/Fast-Subtitle-Maker/tree/main☆28Apr 6, 2026Updated 2 months ago
- ☆55Jul 16, 2025Updated 11 months ago
- OpenAI Whisper Prompt Examples☆53Jul 17, 2023Updated 2 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Feb 20, 2018Updated 8 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆22,462Jun 3, 2026Updated last week
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago