EtienneAb3d / WhisperTimeSyncLinks
Synchronize Whisper's timestamps over an existing accurate transcription
β158Updated last year
Alternatives and similar repositories for WhisperTimeSync
Users that are interested in WhisperTimeSync are comparing it to the libraries listed below
Sorting:
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ345Updated 11 months ago
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β252Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.β124Updated 10 months ago
- Synchronize SRT timestamps over an existing accurate transcriptionβ35Updated 11 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β237Updated 2 months ago
- A testing repo to share code and thoughts on diarisationβ56Updated last year
- Ultimate Vocal Remover CLIβ150Updated 8 months ago
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)β79Updated last year
- Batch Support for OpenAI Whisperβ95Updated last year
- web based editor for subtitles and transcriptsβ141Updated last year
- openvino version of openai/whisperβ176Updated last year
- Fine Tune the Style-TTS2 Voice Modelβ254Updated 4 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitlesβ80Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.β119Updated 2 years ago
- ez audio transcription tool with flexible processing and post-processing optionsβ159Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ96Updated last year
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β404Updated last year
- generate granular word-level captions in srt formatβ57Updated 3 years ago
- Whisper combined with Silero VAD, for improved long-form transcriptionsβ53Updated 2 years ago
- Text to speech alignment using CTC forced alignmentβ371Updated 2 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.β119Updated 2 years ago
- β38Updated 2 years ago
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ158Updated last year
- Your one-stop solution for voice dataset creationβ127Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
- Accelerating faster-whisper single file processing by multiprocessing through parallelizationβ55Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ151Updated last year
- β99Updated last year
- Community framework for training tortoiseβ44Updated 2 years ago
- Official Implementation of StyleTTSβ452Updated 9 months ago