Transcription with speaker diarization pipeline
☆100Apr 27, 2023Updated 3 years ago
Alternatives and similar repositories for speaker-transcription
Users that are interested in speaker-transcription are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Skribify is a powerful transcription and summarization tool that leverages the power of OpenAI's GPT-4 and WhisperAI to generate concise …☆12Apr 29, 2025Updated last year
- web based editor for subtitles and transcripts☆147Aug 16, 2024Updated last year
- ☆14Apr 8, 2026Updated last month
- speaker-disentangled speech linguistic content quantizer☆25Mar 19, 2025Updated last year
- ☆14Aug 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LLM Oracle is a GPT-4 powered tool for predicting future events. It's like a Magic 8 Ball that is able to perform basic research, calcula…☆17May 27, 2023Updated 2 years ago
- optimized wav2lip☆18Jan 6, 2024Updated 2 years ago
- ☆24May 6, 2025Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆219Oct 30, 2024Updated last year
- Code and model info on SOTA finetuned Whisper models for better transcription on Hindi language☆21Jan 15, 2025Updated last year
- DocQues answers queries on longer and multiple documents build on GPT-Index and GPT-3☆13Jan 1, 2023Updated 3 years ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Mar 10, 2023Updated 3 years ago
- An MCP-capable intelligent RSS feed ingestion and summarization to markdown tool.☆31Feb 4, 2026Updated 3 months ago
- A POC project to demonstrate expo-cli devtools plugins with react-native-apollo-devtools-client☆22Nov 18, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,525Feb 23, 2026Updated 3 months ago
- TalkiTo lets developers interact with AI systems through speech across multiple channels (terminal, API, phone). It can be used as both a…☆55Feb 5, 2026Updated 3 months ago
- ☆37Apr 2, 2026Updated last month
- Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan☆67Dec 5, 2023Updated 2 years ago
- MCP server for transcript processing — formatting, contextual repair & smart summarization with deep-thinking LLMs☆19Apr 7, 2026Updated last month
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Jun 1, 2024Updated last year
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated last year
- Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts…☆265Apr 19, 2026Updated last month
- Create Unmute voice embeddings☆25Nov 15, 2025Updated 6 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A Unity project with stripped Schedule I scripts + meta files and plugin reference meta files☆12Apr 1, 2026Updated last month
- Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit☆62Aug 16, 2023Updated 2 years ago
- Browser-based Voice Assistant☆43Mar 31, 2023Updated 3 years ago
- This script is an automated survey bot that conducts political discussions over phone calls. It uses Flask, Twilio's Voice API, OpenAI's …☆12Sep 21, 2023Updated 2 years ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆24Nov 25, 2023Updated 2 years ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆20Oct 13, 2025Updated 7 months ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆37Nov 16, 2022Updated 3 years ago
- Combining GroundingDINO, Segment Anything, ZoeDepth and Multiview Compressive Coding for 3D reconstruction to reconstruct 3D model of the…☆13May 3, 2023Updated 3 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper☆40Oct 27, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A custom node wrapper for Kokoro TTS for ComfyUI☆52Mar 22, 2026Updated 2 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆78Jun 19, 2025Updated 11 months ago
- Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation☆33Dec 10, 2025Updated 5 months ago
- chatterbox TTS + Voice Clone using onnx☆28Dec 31, 2025Updated 4 months ago
- Everthing related to virtual try-on. Research papers, articles, projects, code, datasets, demos, videos, books, workshops, APIs, etc.☆18Jun 19, 2024Updated last year
- DMX Light and Effects control for Elite Dangerous turns your living room into a spaceship!☆16Oct 30, 2016Updated 9 years ago
- MCP Server for Google Flights !!☆26Mar 27, 2025Updated last year