akashmjn / tinydiarizeLinks

Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens

☆503

Alternatives and similar repositories for tinydiarize

Users that are interested in tinydiarize are comparing it to the libraries listed below

Sorting:

aarnphm / whispercpp
Pybind11 bindings for Whisper.cpp
☆334Updated 7 months ago
EtienneAb3d / WhisperHallu
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
☆333Updated 8 months ago
Wordcab / wordcab-transcribe
💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.
☆215Updated 9 months ago
Majdoddin / nlp
☆487Updated last year
shashikg / WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
☆445Updated 11 months ago
Vaibhavs10 / fast-whisper-finetuning
☆533Updated last year
absadiki / pywhispercpp
Python bindings for whisper.cpp
☆278Updated last month
MiscellaneousStuff / openai-whisper-cpu
Improving transcription performance of OpenAI Whisper for CPU based deployment
☆246Updated 2 years ago
shirayu / whispering
Streaming transcriber with whisper
☆690Updated 2 years ago
juanmc2005 / diart
A python package to build AI-powered real-time audio applications
☆1,372Updated 5 months ago
lablab-ai / Whisper-transcription_and_diarization-speaker-identification-
How to use OpenAIs Whisper to transcribe and diarize audio files
☆349Updated 2 years ago
PABannier / bark.cpp
Suno AI's Bark model in C/C++ for fast text-to-speech generation
☆834Updated 8 months ago
hedrergudene / asr-sd-pipeline
Speech recognition & diarisation solution with text alignment, deployed in AML pipelines
☆96Updated last year
yinruiqing / pyannote-whisper
☆614Updated last year
thomasmol / cog-whisper-diarization
Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote
☆217Updated 5 months ago
NavodPeiris / speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…
☆222Updated 3 months ago
huggingface / speechbox
☆359Updated last year
YuanGongND / whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …
☆398Updated last year
ochen1 / insanely-fast-whisper-cli
The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️
☆364Updated last year
stlukey / whispercpp.py
Python bindings for whisper.cpp
☆241Updated last year
tincans-ai / gazelle
Joint speech-language model - respond directly to audio!
☆371Updated last year
Picovoice / cobra
On-device voice activity detection (VAD) powered by deep learning
☆222Updated last week
luweigen / whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
☆120Updated last year
appvoid / vosper
Real-Time Whisper Voice Recognition with vosk model feedback.
☆117Updated 2 years ago
huggingface / diarizers
☆307Updated last year
vasistalodagala / whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
☆322Updated 2 years ago
nyrahealth / CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
☆786Updated last month
saharmor / whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
☆816Updated last year
aiola-lab / whisper-medusa
Whisper with Medusa heads
☆850Updated 3 weeks ago
mustafaaljadery / lightning-whisper-mlx
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
☆755Updated last year