speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names.
☆252Feb 10, 2026Updated 2 months ago
Alternatives and similar repositories for speechlib
Users that are interested in speechlib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,485Feb 23, 2026Updated last month
- Skribify is a powerful transcription and summarization tool that leverages the power of OpenAI's GPT-4 and WhisperAI to generate concise …☆12Apr 29, 2025Updated 11 months ago
- A testing repo to share code and thoughts on diarisation☆57Mar 26, 2024Updated 2 years ago
- ☆491Sep 10, 2025Updated 7 months ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- On-device speaker diarization powered by deep learning☆69Updated this week
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Oct 30, 2024Updated last year
- The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…☆17Aug 24, 2023Updated 2 years ago
- Whisper from OpenAi and diarization with Pyannote☆51Jan 7, 2024Updated 2 years ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆936Jun 3, 2025Updated 10 months ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆166Feb 1, 2024Updated 2 years ago
- Speaker diarization service☆27Feb 24, 2026Updated last month
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆21,210Apr 4, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,734Updated this week
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆174Dec 12, 2025Updated 4 months ago
- Open-source reproducible benchmarks from Argmax☆85Apr 8, 2026Updated last week
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆11Nov 6, 2024Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆35Aug 1, 2025Updated 8 months ago
- ☆666Sep 24, 2025Updated 6 months ago
- ☆325Jun 14, 2024Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆164May 28, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆349Nov 12, 2024Updated last year
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated last year
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆544Nov 6, 2023Updated 2 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- A collection of custom tools and extensions for Open WebUI that enhance its capabilities☆12Dec 11, 2024Updated last year
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 7 months ago
- This script is an automated survey bot that conducts political discussions over phone calls. It uses Flask, Twilio's Voice API, OpenAI's …☆12Sep 21, 2023Updated 2 years ago
- ☆358Mar 17, 2024Updated 2 years ago
- A nearly-live implementation of OpenAI's Whisper.☆3,962Mar 17, 2026Updated 3 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Docker image for WhisperX by Max Bain☆12Sep 24, 2025Updated 6 months ago
- turnkey self-hosted offline transcription and diarization service with llm summary☆923Jan 18, 2026Updated 2 months ago
- A python package to build AI-powered real-time audio applications☆1,966Feb 12, 2025Updated last year
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆445Aug 12, 2025Updated 8 months ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Sep 30, 2022Updated 3 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆105Jan 10, 2025Updated last year