EtienneAb3d / WhisperTimeSyncLinks
Synchronize Whisper's timestamps over an existing accurate transcription
β153Updated last year
Alternatives and similar repositories for WhisperTimeSync
Users that are interested in WhisperTimeSync are comparing it to the libraries listed below
Sorting:
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ331Updated 8 months ago
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β246Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β220Updated 3 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.β120Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitlesβ78Updated 2 years ago
- Timething is a library for aligning text transcripts with their audio recordings.β122Updated 7 months ago
- Synchronize SRT timestamps over an existing accurate transcriptionβ33Updated 8 months ago
- A testing repo to share code and thoughts on diarisationβ55Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ95Updated last year
- web based editor for subtitles and transcriptsβ137Updated 10 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptionsβ52Updated 2 years ago
- β239Updated 3 weeks ago
- generate granular word-level captions in srt formatβ57Updated 2 years ago
- Ultimate Vocal Remover CLIβ149Updated 5 months ago
- β37Updated 2 years ago
- Batch Support for OpenAI Whisperβ94Updated last year
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)β74Updated last year
- Text to speech alignment using CTC forced alignmentβ311Updated 3 months ago
- TorToiSe fine-tuning with DLASβ225Updated 11 months ago
- ez audio transcription tool with flexible processing and post-processing optionsβ154Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ149Updated last year
- π A forced aligner intended for synchronization of narrated textβ93Updated 2 years ago
- Community framework for training tortoiseβ43Updated 2 years ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β395Updated last year
- openvino version of openai/whisperβ168Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.β116Updated 2 years ago
- Your one-stop solution for voice dataset creationβ120Updated last year
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ160Updated 11 months ago
- Accelerating faster-whisper single file processing by multiprocessing through parallelizationβ54Updated 2 years ago