EtienneAb3d / WhisperTimeSyncView external linksLinks
Synchronize Whisper's timestamps over an existing accurate transcription
☆161May 28, 2024Updated last year
Alternatives and similar repositories for WhisperTimeSync
Users that are interested in WhisperTimeSync are comparing it to the libraries listed below
Sorting:
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆348Nov 12, 2024Updated last year
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆87Nov 23, 2023Updated 2 years ago
- SubER - Subtitle Edit Rate☆23Updated this week
- ☆11Nov 7, 2024Updated last year
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Mar 10, 2023Updated 2 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆130Dec 3, 2024Updated last year
- Streaming Vocos☆29Jun 10, 2025Updated 8 months ago
- Transcription, forced alignment, and audio indexing with OpenAI's Whisper☆2,158Oct 29, 2025Updated 3 months ago
- Diffusion Model for Voice Conversion☆69Mar 14, 2024Updated last year
- Russian accentuator and IPA transcriber☆16Sep 10, 2024Updated last year
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆2,759Sep 9, 2025Updated 5 months ago
- ez audio transcription tool with flexible processing and post-processing options☆162Feb 1, 2024Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Mar 14, 2023Updated 2 years ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆413Feb 21, 2024Updated last year
- ☆32Nov 24, 2024Updated last year
- Phoneme segmentation using pre-trained speech models☆55Nov 4, 2022Updated 3 years ago
- ☆18Nov 8, 2022Updated 3 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Feb 20, 2018Updated 7 years ago
- A testing repo to share code and thoughts on diarisation☆57Mar 26, 2024Updated last year
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- generate granular word-level captions in srt format☆57Sep 26, 2022Updated 3 years ago
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Dec 12, 2024Updated last year
- ☆52Jul 16, 2025Updated 6 months ago
- This is a winter of code project aimed at speech enhancement of text to speech models.☆24Feb 6, 2022Updated 4 years ago
- RWKV-SpeechChat is a real-time dialogue script based on a frozen 3B RWKV model with trained adapters and initial states. Various trained …☆28Jan 1, 2025Updated last year
- ☆32Nov 18, 2025Updated 2 months ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Dec 15, 2022Updated 3 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- Use a video and cut out portions of it without re-mounting the video inbetween.☆15Sep 23, 2024Updated last year
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- A lightweight muji-moe chatbot created by Reecho.ai.☆12Oct 1, 2024Updated last year
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated 11 months ago
- silero-vad pytorch implement☆34Nov 23, 2024Updated last year
- Generative Adversarial Networks for different impaired speech conversions☆38Jul 6, 2023Updated 2 years ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆58Apr 17, 2024Updated last year
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Sep 2, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year