Blair-Johnson / batch-whisper
Batch Support for OpenAI Whisper
ā91Updated last year
Alternatives and similar repositories for batch-whisper:
Users that are interested in batch-whisper are comparing it to the libraries listed below
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.ā135Updated last year
- ā350Updated 11 months ago
- š¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.ā205Updated 4 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeā145Updated 10 months ago
- ā274Updated 8 months ago
- Synchronize Whisper's timestamps over an existing accurate transcriptionā142Updated 9 months ago
- ā153Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesā92Updated 9 months ago
- [WIP] Scripts for fine-tuning Whisperā219Updated last year
- Zero-shot Audio Classification using Whisperā80Updated 2 years ago
- openvino version of openai/whisperā165Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisperā109Updated 2 years ago
- Improving transcription performance of OpenAI Whisper for CPU based deploymentā239Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extractionā90Updated 10 months ago
- whisper.cpp bindings for pythonā87Updated last year
- A python package for deep multilingual punctuation prediction.ā117Updated 6 months ago
- Various speech datasets made available to the publicā113Updated 2 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsā310Updated 3 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event ā¦ā356Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPā¦ā93Updated 4 months ago
- Speaker Diarization with Transformersā64Updated 9 months ago
- ā72Updated this week
- ā35Updated 2 years ago
- ā71Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) databaseā97Updated 3 weeks ago
- A model that predicts the punctuation of English, Italian, French and German texts.ā79Updated 2 years ago
- Python bindings for whisper.cppā225Updated this week
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.ā80Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.ā110Updated last year
- š Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. š§š„š Advanced audio processing.ā234Updated 8 months ago