Synchronize Whisper's timestamps over an existing accurate transcription
☆164May 28, 2024Updated last year
Alternatives and similar repositories for WhisperTimeSync
Users that are interested in WhisperTimeSync are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆349Nov 12, 2024Updated last year
- SubER - Subtitle Edit Rate☆24Feb 19, 2026Updated last month
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆89Nov 23, 2023Updated 2 years ago
- ☆38Dec 26, 2022Updated 3 years ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Mar 10, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Nov 7, 2024Updated last year
- Transcription, forced alignment, and audio indexing with OpenAI's Whisper☆2,215Oct 29, 2025Updated 5 months ago
- Timething is a library for aligning text transcripts with their audio recordings.☆130Dec 3, 2024Updated last year
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆2,794Sep 9, 2025Updated 7 months ago
- Modern fork of demucs☆22Updated this week
- ez audio transcription tool with flexible processing and post-processing options☆166Feb 1, 2024Updated 2 years ago
- Streaming Vocos☆30Jun 10, 2025Updated 10 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆412Feb 21, 2024Updated 2 years ago
- Russian accentuator and IPA transcriber☆16Sep 10, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Mar 14, 2023Updated 3 years ago
- ☆44Aug 30, 2024Updated last year
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆936Jun 3, 2025Updated 10 months ago
- Mine from pdfs created with Mokuro2Pdf☆10Dec 13, 2024Updated last year
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆59Apr 17, 2024Updated last year
- Diffusion Model for Voice Conversion☆71Mar 14, 2024Updated 2 years ago
- ☆10Jun 8, 2024Updated last year
- A Python script for AI speech recognition of video or audio file using whisper, stable-ts or faster-whisper and translation of subtitle u…☆10Feb 17, 2025Updated last year
- MCP server to expose local zotero repository to MCP clients☆26Jun 4, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆252Feb 10, 2026Updated 2 months ago
- ☆32Nov 24, 2024Updated last year
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…☆55Nov 4, 2022Updated 3 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- A lightweight muji-moe chatbot created by Reecho.ai.☆13Oct 1, 2024Updated last year
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆41Sep 18, 2024Updated last year
- Transcribe with ease :D☆16Jun 21, 2023Updated 2 years ago
- A testing repo to share code and thoughts on diarisation☆57Mar 26, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆17Jan 31, 2023Updated 3 years ago
- SEGAN for bandwidth extension☆15Jun 6, 2019Updated 6 years ago
- ☆88Jul 31, 2025Updated 8 months ago
- ☆54Jul 16, 2025Updated 9 months ago
- ☆18Nov 8, 2022Updated 3 years ago
- OpenAI Whisper Prompt Examples☆52Jul 17, 2023Updated 2 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Feb 20, 2018Updated 8 years ago