shashikg / WhisperS2TLinks
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
β445Updated 11 months ago
Alternatives and similar repositories for WhisperS2T
Users that are interested in WhisperS2T are comparing it to the libraries listed below
Sorting:
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ786Updated last month
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β215Updated 9 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β222Updated 3 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ333Updated 8 months ago
- Python bindings for whisper.cppβ278Updated last month
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokensβ503Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β322Updated 2 years ago
- β533Updated last year
- β307Updated last year
- β244Updated last month
- A python package to build AI-powered real-time audio applicationsβ1,372Updated 5 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deploymentβ246Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extractionβ97Updated last month
- Whisper realtime streaming for long speech-to-text transcription and translationβ120Updated last year
- β614Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ96Updated last year
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β398Updated last year
- β359Updated last year
- Pybind11 bindings for Whisper.cppβ334Updated 7 months ago
- Whisper with Medusa headsβ850Updated 3 weeks ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ364Updated last year
- Joint speech-language model - respond directly to audio!β371Updated last year
- Text to speech alignment using CTC forced alignmentβ329Updated 4 months ago
- β97Updated last year
- How to use OpenAIs Whisper to transcribe and diarize audio filesβ349Updated 2 years ago
- Open source inference code for Rev's modelβ415Updated 3 months ago
- Python bindings for whisper.cppβ241Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteβ217Updated 5 months ago
- openvino version of openai/whisperβ170Updated last year
- G2Pβ288Updated 3 months ago