shashikg / WhisperS2TLinks
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
β455Updated 11 months ago
Alternatives and similar repositories for WhisperS2T
Users that are interested in WhisperS2T are comparing it to the libraries listed below
Sorting:
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β217Updated 9 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ798Updated 2 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ336Updated 9 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β225Updated last week
- Python bindings for whisper.cppβ281Updated last week
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokensβ508Updated last year
- β539Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β325Updated 2 years ago
- β619Updated last year
- A python package to build AI-powered real-time audio applicationsβ1,410Updated 6 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ98Updated 2 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deploymentβ247Updated 2 years ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β402Updated last year
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and β¦β342Updated last month
- β307Updated last year
- β247Updated 2 months ago
- Joint speech-language model - respond directly to audio!β370Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translationβ121Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ96Updated last year
- Python bindings for whisper.cppβ244Updated last year
- β486Updated 2 years ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteβ219Updated 6 months ago
- Text to speech alignment using CTC forced alignmentβ338Updated last week
- Pybind11 bindings for Whisper.cppβ337Updated 8 months ago
- Whisper with Medusa headsβ852Updated 2 weeks ago
- openvino version of openai/whisperβ172Updated last year
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JSβ900Updated 10 months ago
- G2Pβ306Updated last week
- How to use OpenAIs Whisper to transcribe and diarize audio filesβ351Updated 2 years ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ368Updated last year