akashmjn / tinydiarizeLinks
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
β520Updated last year
Alternatives and similar repositories for tinydiarize
Users that are interested in tinydiarize are comparing it to the libraries listed below
Sorting:
- Pybind11 bindings for Whisper.cppβ340Updated 10 months ago
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β217Updated 11 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engineβ474Updated last year
- β488Updated last month
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ345Updated 11 months ago
- Streaming transcriber with whisperβ690Updated 2 years ago
- β545Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deploymentβ253Updated 2 years ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteβ228Updated 8 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β404Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β237Updated 2 months ago
- Joint speech-language model - respond directly to audio!β371Updated last year
- How to use OpenAIs Whisper to transcribe and diarize audio filesβ361Updated 3 years ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generationβ839Updated 11 months ago
- Python bindings for whisper.cppβ295Updated this week
- β358Updated last year
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ847Updated 4 months ago
- On-device voice activity detection (VAD) powered by deep learningβ231Updated 3 weeks ago
- A python package to build AI-powered real-time audio applicationsβ1,484Updated 8 months ago
- β632Updated 3 weeks ago
- Whisper with Medusa headsβ861Updated 2 months ago
- Python bindings for whisper.cppβ246Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.β119Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ96Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β345Updated 2 years ago
- whisper.cpp bindings for pythonβ105Updated 2 years ago
- β310Updated last year
- openvino version of openai/whisperβ176Updated last year
- Performant and accurate speech recognition built on Pytorchβ254Updated 3 years ago
- Transcription with speaker diarization pipelineβ94Updated 2 years ago