Transcription with speaker diarization pipeline
☆99Apr 27, 2023Updated 3 years ago
Alternatives and similar repositories for speaker-transcription
Users that are interested in speaker-transcription are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speaker diarization model☆32Apr 1, 2023Updated 3 years ago
- web based editor for subtitles and transcripts☆146Aug 16, 2024Updated last year
- ☆14Aug 19, 2024Updated last year
- How to use OpenAIs Whisper to transcribe and diarize audio files☆375Oct 12, 2022Updated 3 years ago
- LLM Oracle is a GPT-4 powered tool for predicting future events. It's like a Magic 8 Ball that is able to perform basic research, calcula…☆17May 27, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- optimized wav2lip☆18Jan 6, 2024Updated 2 years ago
- ☆24May 6, 2025Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆219Oct 30, 2024Updated last year
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Mar 10, 2023Updated 3 years ago
- C++ PyTorch Examples☆10Aug 18, 2019Updated 6 years ago
- An MCP-capable intelligent RSS feed ingestion and summarization to markdown tool.☆29Feb 4, 2026Updated 3 months ago
- A silly and weirdly useful experiment where I attempt to encode one bit of information with a VAE☆11Dec 31, 2016Updated 9 years ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,506Feb 23, 2026Updated 2 months ago
- Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan☆67Dec 5, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- MCP server for transcript processing — formatting, contextual repair & smart summarization with deep-thinking LLMs☆19Apr 7, 2026Updated 3 weeks ago
- Connecting Alexa with ChatGPT via custom Alexa skill to have continuous conversation with memory.☆10Mar 30, 2023Updated 3 years ago
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆27Oct 13, 2025Updated 6 months ago
- Harmonic track list maker based on the Camelot key system.☆11Feb 19, 2020Updated 6 years ago
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated last year
- Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts…☆258Apr 19, 2026Updated 2 weeks ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆20Apr 20, 2026Updated 2 weeks ago
- Easier analysis of large speech corpora☆24Jun 22, 2021Updated 4 years ago
- Create Unmute voice embeddings☆25Nov 15, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Unity project with stripped Schedule I scripts + meta files and plugin reference meta files☆12Apr 1, 2026Updated last month
- Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit☆62Aug 16, 2023Updated 2 years ago
- Browser-based Voice Assistant☆43Mar 31, 2023Updated 3 years ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆24Nov 25, 2023Updated 2 years ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆20Oct 13, 2025Updated 6 months ago
- A p2p service registry☆17Feb 24, 2024Updated 2 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper☆40Oct 27, 2022Updated 3 years ago
- An MCP-capable intelligent Apple podcast transcription and summarization to markdown tool.☆29Feb 2, 2026Updated 3 months ago
- Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation☆32Dec 10, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- chatterbox TTS + Voice Clone using onnx☆28Dec 31, 2025Updated 4 months ago
- Everthing related to virtual try-on. Research papers, articles, projects, code, datasets, demos, videos, books, workshops, APIs, etc.☆17Jun 19, 2024Updated last year
- DMX Light and Effects control for Elite Dangerous turns your living room into a spaceship!☆16Oct 30, 2016Updated 9 years ago
- Convert Slack messages exported in their complicated JSON format to simple CSV format☆14Mar 4, 2022Updated 4 years ago
- Learn by example : Operating system☆33Apr 23, 2017Updated 9 years ago
- This is a legacy repo. Dev occurs now on GitHub.☆11Mar 28, 2021Updated 5 years ago
- A chatbot that can be used by businesses to communicate with customers via Whatsapp SMS using the Twilio API and OpenAI's GPT-3 language …☆33Aug 25, 2024Updated last year