shashikg / WhisperS2TLinks
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
β465Updated last year
Alternatives and similar repositories for WhisperS2T
Users that are interested in WhisperS2T are comparing it to the libraries listed below
Sorting:
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ813Updated 3 months ago
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β218Updated 10 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ340Updated 10 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β233Updated 3 weeks ago
- Python bindings for whisper.cppβ288Updated last week
- β539Updated last year
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokensβ516Updated last year
- A python package to build AI-powered real-time audio applicationsβ1,449Updated 7 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β334Updated 2 years ago
- β488Updated 2 years ago
- β623Updated last year
- Efficient approach to speaker diarization using voice characteristics extractionβ99Updated 2 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β404Updated last year
- β309Updated last year
- β251Updated 2 months ago
- Pybind11 bindings for Whisper.cppβ339Updated 9 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ121Updated last year
- β359Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deploymentβ249Updated 2 years ago
- Joint speech-language model - respond directly to audio!β371Updated last year
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and β¦β356Updated 2 weeks ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteβ223Updated 6 months ago
- How to use OpenAIs Whisper to transcribe and diarize audio filesβ355Updated 2 years ago
- Whisper with Medusa headsβ853Updated last month
- G2Pβ316Updated last month
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ97Updated last year
- Python bindings for whisper.cppβ245Updated last year
- Text to speech alignment using CTC forced alignmentβ354Updated 3 weeks ago
- Batch Support for OpenAI Whisperβ95Updated last year
- openvino version of openai/whisperβ175Updated last year