How to use OpenAIs Whisper to transcribe and diarize audio files
☆375Oct 12, 2022Updated 3 years ago
Alternatives and similar repositories for Whisper-transcription_and_diarization-speaker-identification-
Users that are interested in Whisper-transcription_and_diarization-speaker-identification- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,506Feb 23, 2026Updated 2 months ago
- ☆671Sep 24, 2025Updated 7 months ago
- Convert a directory of .vtt or json transcripts into a fast searchable database☆19Oct 7, 2024Updated last year
- ☆491Sep 10, 2025Updated 7 months ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,877Apr 16, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Transcription and diarization (speaker identification)☆33May 31, 2023Updated 2 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆21,615Apr 4, 2026Updated last month
- PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-po…☆17Dec 11, 2022Updated 3 years ago
- Transcription with speaker diarization pipeline☆99Apr 27, 2023Updated 3 years ago
- Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts…☆258Apr 19, 2026Updated 2 weeks ago
- Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.☆14Aug 13, 2023Updated 2 years ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆547Nov 6, 2023Updated 2 years ago
- extension to download page elements and import to figma☆21Dec 12, 2020Updated 5 years ago
- Speech Diarization for scrum automation☆111Jul 27, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Mar 31, 2023Updated 3 years ago
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆47Aug 6, 2024Updated last year
- Transcription and Diarization based on OpenAI's Whisper☆25Sep 9, 2025Updated 7 months ago
- This project demonstrates how to parse emails, process them using OpenAI's GPT-3.5, and load the data into a Weaviate vector database for…☆22May 3, 2023Updated 3 years ago
- The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)👇☆19May 25, 2023Updated 2 years ago
- Simple Android SDK for Publitio☆10Jan 16, 2021Updated 5 years ago
- A python package to build AI-powered real-time audio applications☆1,974Feb 12, 2025Updated last year
- nicar 17: advanced pdf manipulation☆18Mar 4, 2017Updated 9 years ago
- A simple extension that uses Bark Text-to-Speech for audio output☆11Nov 20, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Using OpenAI's Whisper to automatically generate YouTube subtitles☆1,440Jan 16, 2024Updated 2 years ago
- ☆11Feb 15, 2025Updated last year
- ☆12,736Oct 25, 2025Updated 6 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,689Apr 3, 2024Updated 2 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- Dataset of ML and NLP papers☆34Aug 17, 2022Updated 3 years ago
- Run different pipelines of WhisperX - Transcription, Diarization, VAD, Alignment completely OFFLINE.☆47Mar 30, 2025Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper☆40Oct 27, 2022Updated 3 years ago
- Faster Whisper transcription with CTranslate2☆22,511Nov 19, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- using local LLMs with Synology Nas☆14Sep 7, 2025Updated 8 months ago
- EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy a…☆55Apr 17, 2026Updated 2 weeks ago
- ☆16Jun 26, 2024Updated last year
- FinRAD: Financial Readability Assessment Dataset - 13,000+ Definitions of Financial Terms for Measuring Readability☆15Nov 2, 2024Updated last year
- Tabs on Tallahassee☆11Dec 5, 2016Updated 9 years ago
- Batch Local Transcribe Audio/Movie To Text With Whisper AI Model. Keep Privacy Safe!☆41Nov 3, 2025Updated 6 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆98,662Apr 15, 2026Updated 3 weeks ago