How to use OpenAIs Whisper to transcribe and diarize audio files
☆377Oct 12, 2022Updated 3 years ago
Alternatives and similar repositories for Whisper-transcription_and_diarization-speaker-identification-
Users that are interested in Whisper-transcription_and_diarization-speaker-identification- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,563Feb 23, 2026Updated 3 months ago
- ☆491Sep 10, 2025Updated 9 months ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆10,104Jun 6, 2026Updated last week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆22,462Jun 3, 2026Updated 2 weeks ago
- PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-po…☆17Dec 11, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts…☆265Apr 19, 2026Updated last month
- Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.☆14Aug 13, 2023Updated 2 years ago
- Speech Diarization for scrum automation☆111Jul 27, 2023Updated 2 years ago
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆47Aug 6, 2024Updated last year
- The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)👇☆19May 25, 2023Updated 3 years ago
- Simple Android SDK for Publitio☆10Jan 16, 2021Updated 5 years ago
- A python package to build AI-powered real-time audio applications☆1,985Feb 12, 2025Updated last year
- The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…☆18Aug 24, 2023Updated 2 years ago
- A simple extension that uses Bark Text-to-Speech for audio output☆10Nov 20, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Using OpenAI's Whisper to automatically generate YouTube subtitles☆1,441Jan 16, 2024Updated 2 years ago
- ☆11Feb 15, 2025Updated last year
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,686Apr 3, 2024Updated 2 years ago
- ☆12,966Oct 25, 2025Updated 7 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- Run different pipelines of WhisperX - Transcription, Diarization, VAD, Alignment completely OFFLINE.☆48Mar 30, 2025Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper☆40Oct 27, 2022Updated 3 years ago
- Faster Whisper transcription with CTranslate2☆23,584Nov 19, 2025Updated 6 months ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,872Jun 1, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy a…☆57Updated this week
- Tabs on Tallahassee☆11Dec 5, 2016Updated 9 years ago
- Batch Local Transcribe Audio/Movie To Text With Whisper AI Model. Keep Privacy Safe!☆41Nov 3, 2025Updated 7 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆102,585Apr 15, 2026Updated 2 months ago
- private repo for nonfiction drafting☆17Oct 24, 2023Updated 2 years ago
- ☆28Apr 16, 2024Updated 2 years ago
- ez audio transcription tool with flexible processing and post-processing options☆168Feb 1, 2024Updated 2 years ago
- FastAPI service on top of WhisperX☆180Updated this week
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆451Jun 7, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A light-weight framework for creating applications using LLMs☆96Jul 30, 2023Updated 2 years ago
- ☆38Dec 26, 2022Updated 3 years ago
- LocalAI website, powered by Hugo☆15Nov 22, 2023Updated 2 years ago
- ☆14Apr 2, 2024Updated 2 years ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆11Apr 6, 2020Updated 6 years ago
- ☆11Mar 18, 2024Updated 2 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago