meronym/speaker-transcription

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/meronym/speaker-transcription)

meronym / speaker-transcription

Transcription with speaker diarization pipeline

☆101

Alternatives and similar repositories for speaker-transcription

Users that are interested in speaker-transcription are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

meronym / speaker-diarization
View on GitHub
Speaker diarization model
☆31Apr 1, 2023Updated 3 years ago
JFalnes / Skribify
View on GitHub
Skribify is a powerful transcription and summarization tool that leverages the power of OpenAI's GPT-4 and WhisperAI to generate concise …
☆12Apr 29, 2025Updated last year
dmse4tts / DMSE4TTS
View on GitHub
☆24May 6, 2025Updated last year
lablab-ai / Whisper-transcription_and_diarization-speaker-identification-
View on GitHub
How to use OpenAIs Whisper to transcribe and diarize audio files
☆377Oct 12, 2022Updated 3 years ago
sshh12 / llm_oracle
View on GitHub
LLM Oracle is a GPT-4 powered tool for predicting future events. It's like a Magic 8 Ball that is able to perform basic research, calcula…
☆17May 27, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Wordcab / wordcab-transcribe
View on GitHub
💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.
☆219Oct 30, 2024Updated last year
ravi03071991 / DocQues
View on GitHub
DocQues answers queries on longer and multiple documents build on GPT-Index and GPT-3
☆13Jan 1, 2023Updated 3 years ago
keon / cpp-pytorch
View on GitHub
C++ PyTorch Examples
☆10Aug 18, 2019Updated 6 years ago
Fcabla / whisper_subtitler
View on GitHub
Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…
☆19Mar 10, 2023Updated 3 years ago
OptimalFoundation / nadir
View on GitHub
Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! 🔥🚀💻
☆14Jun 15, 2024Updated 2 years ago
RuiShu / one-bit-vae
View on GitHub
A silly and weirdly useful experiment where I attempt to encode one bit of information with a VAE
☆11Dec 31, 2016Updated 9 years ago
Kudo / expo-devtools-plugin-demo
View on GitHub
A POC project to demonstrate expo-cli devtools plugins with react-native-apollo-devtools-client
☆22Nov 18, 2023Updated 2 years ago
MahmoudAshraf97 / whisper-diarization
View on GitHub
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
☆5,614Feb 23, 2026Updated 5 months ago
jakubpeleska / redelex
View on GitHub
ReDeLEx is a Python framework for developing and evaluating RDL models on relational databases via RelBench and CTU datasets.
☆20May 22, 2026Updated 2 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Skippeh / ScheduleOne_UnityProject
View on GitHub
A Unity project with stripped Schedule I scripts + meta files and plugin reference meta files
☆13Apr 1, 2026Updated 3 months ago
Rumeysakeskin / Turkish-Text-to-Speech
View on GitHub
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
☆69Dec 5, 2023Updated 2 years ago
RodrigoDeRosa-zz / track-list-manager
View on GitHub
Harmonic track list maker based on the Camelot key system.
☆11Feb 19, 2020Updated 6 years ago
iiscleap / ZEST
View on GitHub
Zero-Shot Emotion Style Transfer
☆49Apr 23, 2025Updated last year
PhialsBasement / Zonos-TTS-MCP
View on GitHub
MCP server that allows Claude to have a voice.
☆14May 5, 2025Updated last year
kyegomez / Audio-xLSTMs
View on GitHub
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆20Updated this week
tbdsux / koyo
View on GitHub
Website screenshot service api on Deta Space
☆13Jun 6, 2023Updated 3 years ago
NavodPeiris / speechlib
View on GitHub
Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts…
☆266Apr 19, 2026Updated 3 months ago
pappitti / modernbert-mlx
View on GitHub
Implementation of ModernBERT in MLX
☆21Jan 7, 2026Updated 6 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
AkshitPareek / 3D-reconstruction-of-an-object-from-a-Single-Image-and-a-Text-Prompt
View on GitHub
Combining GroundingDINO, Segment Anything, ZoeDepth and Multiview Compressive Coding for 3D reconstruction to reconstruct 3D model of the…
☆13May 3, 2023Updated 3 years ago
StanGirard / speechdigest
View on GitHub
Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit
☆62Aug 16, 2023Updated 2 years ago
jacoyutorius / d3-history-timeline
View on GitHub
visualize history. Nuxt + D3
☆12Jun 20, 2018Updated 8 years ago
yrvelez / ivr_bot
View on GitHub
This script is an automated survey bot that conducts political discussions over phone calls. It uses Flask, Twilio's Voice API, OpenAI's …
☆12Sep 21, 2023Updated 2 years ago
juranki / diy-sveltekit-cdk-adapter
View on GitHub
An exercise on deploying SvelteKit with CDK
☆11Jan 21, 2022Updated 4 years ago
fisherdarling / do-proxy
View on GitHub
A library for writing type-safe Durable Objects in Rust.
☆15Nov 20, 2022Updated 3 years ago
zouyinstein / hifisr
View on GitHub
HiFi-SR is a Python-based pipeline for the detection of plant mitochondrial structural rearrangements based on the mapping of PacBio high…
☆11Jun 26, 2026Updated last month
tiero / whisperd
View on GitHub
The OpenAI Whisper speech-to-text model as a simple HTTP server
☆14Oct 26, 2023Updated 2 years ago
evahuman / EVA
View on GitHub
☆31Feb 22, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
r9y9 / MelGeneralizedCepstrums.jl
View on GitHub
Mel-Generalized Cepstrum analysis
☆19Jul 21, 2017Updated 9 years ago
remotion-dev / 4-0-trailer
View on GitHub
The intro for the Remotion 4.0 keynote and some overlays for it
☆14Jul 22, 2025Updated last year
wiredhut / wiredflow
View on GitHub
Lightweight library for creating services using just Python
☆11Aug 1, 2023Updated 2 years ago
NextBrain-ai / nbsynthetic
View on GitHub
nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasets
☆71Feb 22, 2023Updated 3 years ago
neonbjb / ocotillo
View on GitHub
Performant and accurate speech recognition built on Pytorch
☆254May 19, 2022Updated 4 years ago
pyannote / pyannote-audio
View on GitHub
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…
☆10,351Updated this week
nateraw / voice-cloning
View on GitHub
Make Kanye sing any song ya want 🎤🔥
☆26Apr 25, 2023Updated 3 years ago