akashmjn / tinydiarizeLinks
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
β496Updated last year
Alternatives and similar repositories for tinydiarize
Users that are interested in tinydiarize are comparing it to the libraries listed below
Sorting:
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ326Updated 6 months ago
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β213Updated 7 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engineβ417Updated 9 months ago
- β520Updated 10 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deploymentβ244Updated 2 years ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β384Updated last year
- β483Updated last year
- β291Updated 11 months ago
- Pybind11 bindings for Whisper.cppβ330Updated 5 months ago
- A python package to build AI-powered real-time audio applicationsβ1,308Updated 3 months ago
- β356Updated last year
- Streaming transcriber with whisperβ686Updated 2 years ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generationβ817Updated 6 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ352Updated 11 months ago
- How to use OpenAIs Whisper to transcribe and diarize audio filesβ343Updated 2 years ago
- Open source inference code for Rev's modelβ404Updated last month
- β591Updated last year
- Whisper with Medusa headsβ838Updated last month
- β1,128Updated 3 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated last year
- Python bindings for whisper.cppβ258Updated last week
- Joint speech-language model - respond directly to audio!β368Updated 11 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β217Updated last month
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β308Updated 2 years ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ715Updated 5 months ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteβ212Updated 3 months ago
- On-device voice activity detection (VAD) powered by deep learningβ216Updated 3 weeks ago
- openvino version of openai/whisperβ166Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ94Updated last year
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.β706Updated last year