lablab-ai / Whisper-transcription_and_diarization-speaker-identification-
How to use OpenAIs Whisper to transcribe and diarize audio files
☆320Updated 2 years ago
Alternatives and similar repositories for Whisper-transcription_and_diarization-speaker-identification-:
Users that are interested in Whisper-transcription_and_diarization-speaker-identification- are comparing it to the libraries listed below
- ☆470Updated last year
- ☆548Updated 8 months ago
- Real time speech to text transcription app.☆396Updated 2 years ago
- Podalize: Podcast Transcription and Analysis☆153Updated 4 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆471Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆352Updated 5 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆175Updated 3 months ago
- Transcription and diarization (speaker identification)☆30Updated last year
- Streaming transcriber with whisper☆687Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆189Updated 4 months ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆345Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆203Updated 2 months ago
- A curated list of awesome OpenAI's Whisper☆97Updated last year
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆970Updated 3 weeks ago
- Transcription with speaker diarization pipeline☆89Updated last year
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆333Updated 7 months ago
- ☆348Updated 10 months ago
- Python bindings for whisper.cpp☆210Updated this week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆108Updated last year
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆211Updated this week
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆278Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆297Updated 2 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆91Updated 8 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆189Updated 3 months ago
- Pybind11 bindings for Whisper.cpp☆328Updated last month
- Project that allows one to use a microphone with OpenAI whisper.☆741Updated 6 months ago
- Live-Transcription (STT) with Whisper PoC☆167Updated 7 months ago
- ☆489Updated 6 months ago
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆805Updated 9 months ago