How to use OpenAIs Whisper to transcribe and diarize audio files
☆374Oct 12, 2022Updated 3 years ago
Alternatives and similar repositories for Whisper-transcription_and_diarization-speaker-identification-
Users that are interested in Whisper-transcription_and_diarization-speaker-identification- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,443Feb 23, 2026Updated last month
- ☆666Sep 24, 2025Updated 6 months ago
- Convert a directory of .vtt or json transcripts into a fast searchable database☆19Oct 7, 2024Updated last year
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,398Mar 12, 2026Updated 2 weeks ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆20,821Mar 17, 2026Updated last week
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-po…☆17Dec 11, 2022Updated 3 years ago
- Transcription with speaker diarization pipeline☆98Apr 27, 2023Updated 2 years ago
- Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.☆14Aug 13, 2023Updated 2 years ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆543Nov 6, 2023Updated 2 years ago
- Speech Diarization for scrum automation☆111Jul 27, 2023Updated 2 years ago
- ☆11Mar 31, 2023Updated 2 years ago
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆48Aug 6, 2024Updated last year
- Automatic Detection of Potentially Idiomatic Expressions☆12Feb 19, 2021Updated 5 years ago
- Transcription and Diarization based on OpenAI's Whisper☆25Sep 9, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This project demonstrates how to parse emails, process them using OpenAI's GPT-3.5, and load the data into a Weaviate vector database for…☆22May 3, 2023Updated 2 years ago
- The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)👇☆19May 25, 2023Updated 2 years ago
- A python package to build AI-powered real-time audio applications☆1,958Feb 12, 2025Updated last year
- Simple Android SDK for Publitio☆10Jan 16, 2021Updated 5 years ago
- Archive of political ad data from the Federal Communications Commission☆20Oct 25, 2017Updated 8 years ago
- ☆10Apr 3, 2024Updated last year
- nicar 17: advanced pdf manipulation☆18Mar 4, 2017Updated 9 years ago
- The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…☆17Aug 24, 2023Updated 2 years ago
- A simple extension that uses Bark Text-to-Speech for audio output☆11Nov 20, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆8,833Oct 25, 2025Updated 5 months ago
- Using OpenAI's Whisper to automatically generate YouTube subtitles☆1,430Jan 16, 2024Updated 2 years ago
- ☆11Feb 15, 2025Updated last year
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,688Apr 3, 2024Updated last year
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper☆40Oct 27, 2022Updated 3 years ago
- using local LLMs with Synology Nas☆14Sep 7, 2025Updated 6 months ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,853Jul 22, 2025Updated 8 months ago
- ☆16Jun 26, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Robust Speech Recognition via Large-Scale Weak Supervision☆96,288Dec 15, 2025Updated 3 months ago
- private repo for nonfiction drafting☆17Oct 24, 2023Updated 2 years ago
- ez audio transcription tool with flexible processing and post-processing options☆165Feb 1, 2024Updated 2 years ago
- FastAPI service on top of WhisperX☆174Updated this week
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆423Updated this week
- ☆13Jun 29, 2024Updated last year
- Implementation of various Machine learning and MLOps applications/tutorials used within my Medium blog.☆11Jan 28, 2023Updated 3 years ago