Transcription with speaker diarization pipeline
☆100Apr 27, 2023Updated 3 years ago
Alternatives and similar repositories for speaker-transcription
Users that are interested in speaker-transcription are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speaker diarization model☆31Apr 1, 2023Updated 3 years ago
- Skribify is a powerful transcription and summarization tool that leverages the power of OpenAI's GPT-4 and WhisperAI to generate concise …☆12Apr 29, 2025Updated last year
- web based editor for subtitles and transcripts☆147Aug 16, 2024Updated last year
- ☆15May 26, 2026Updated 3 weeks ago
- speaker-disentangled speech linguistic content quantizer☆25Mar 19, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- How to use OpenAIs Whisper to transcribe and diarize audio files☆377Oct 12, 2022Updated 3 years ago
- Automatic Speech Recognition tool☆20Aug 5, 2023Updated 2 years ago
- LLM Oracle is a GPT-4 powered tool for predicting future events. It's like a Magic 8 Ball that is able to perform basic research, calcula…☆17May 27, 2023Updated 3 years ago
- optimized wav2lip☆18Jan 6, 2024Updated 2 years ago
- ☆24May 6, 2025Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆220Oct 30, 2024Updated last year
- DocQues answers queries on longer and multiple documents build on GPT-Index and GPT-3☆13Jan 1, 2023Updated 3 years ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Mar 10, 2023Updated 3 years ago
- A silly and weirdly useful experiment where I attempt to encode one bit of information with a VAE☆11Dec 31, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,563Feb 23, 2026Updated 3 months ago
- Audio transcription using mlx whisper and vad silence processing☆17Oct 14, 2024Updated last year
- TalkiTo lets developers interact with AI systems through speech across multiple channels (terminal, API, phone). It can be used as both a…☆55Feb 5, 2026Updated 4 months ago
- ☆37Jun 1, 2026Updated 2 weeks ago
- Harmonic track list maker based on the Camelot key system.☆11Feb 19, 2020Updated 6 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Jun 1, 2024Updated 2 years ago
- Maintained fork for Tauri update server powered by Cloudflare Workers☆34Updated this week
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated last year
- Easier analysis of large speech corpora☆24Jun 22, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆20Jun 8, 2026Updated last week
- dnstap utilities implemented in Rust☆14Jul 21, 2025Updated 10 months ago
- Browser-based Voice Assistant☆43Mar 31, 2023Updated 3 years ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆24Nov 25, 2023Updated 2 years ago
- This script is an automated survey bot that conducts political discussions over phone calls. It uses Flask, Twilio's Voice API, OpenAI's …☆12Sep 21, 2023Updated 2 years ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆20Oct 13, 2025Updated 8 months ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆37Nov 16, 2022Updated 3 years ago
- Create Unmute voice embeddings☆26Nov 15, 2025Updated 7 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper☆40Oct 27, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆78Jun 19, 2025Updated 11 months ago
- Real-time Google Search API for AI Agents & RAG pipelines. Get structured SERP data instantly using remote browsers.☆27Mar 9, 2026Updated 3 months ago
- An alpha playground for a web-based labeling tool for DLC☆16Jun 9, 2020Updated 6 years ago
- Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation☆33Dec 10, 2025Updated 6 months ago
- chatterbox TTS + Voice Clone using onnx☆28Dec 31, 2025Updated 5 months ago
- Colaboratory Notebook for MDX Model B☆18Apr 17, 2025Updated last year
- MCP Server for Google Flights !!☆27Mar 27, 2025Updated last year