Transcription with speaker diarization pipeline
☆98Apr 27, 2023Updated 2 years ago
Alternatives and similar repositories for speaker-transcription
Users that are interested in speaker-transcription are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speaker diarization model☆32Apr 1, 2023Updated 3 years ago
- Skribify is a powerful transcription and summarization tool that leverages the power of OpenAI's GPT-4 and WhisperAI to generate concise …☆12Apr 29, 2025Updated 11 months ago
- web based editor for subtitles and transcripts☆147Aug 16, 2024Updated last year
- ☆14Apr 8, 2026Updated last week
- ☆14Aug 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- How to use OpenAIs Whisper to transcribe and diarize audio files☆376Oct 12, 2022Updated 3 years ago
- optimized wav2lip☆18Jan 6, 2024Updated 2 years ago
- ☆24May 6, 2025Updated 11 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Oct 30, 2024Updated last year
- Code and model info on SOTA finetuned Whisper models for better transcription on Hindi language☆20Jan 15, 2025Updated last year
- DocQues answers queries on longer and multiple documents build on GPT-Index and GPT-3☆13Jan 1, 2023Updated 3 years ago
- FastAPI app that uses OpenAI APIs to stream responses☆19Jun 27, 2024Updated last year
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Mar 10, 2023Updated 3 years ago
- C++ PyTorch Examples☆10Aug 18, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! 🔥🚀💻☆14Jun 15, 2024Updated last year
- A silly and weirdly useful experiment where I attempt to encode one bit of information with a VAE☆11Dec 31, 2016Updated 9 years ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,485Feb 23, 2026Updated last month
- TalkiTo lets developers interact with AI systems through speech across multiple channels (terminal, API, phone). It can be used as both a…☆54Feb 5, 2026Updated 2 months ago
- ☆37Apr 2, 2026Updated 2 weeks ago
- Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan☆66Dec 5, 2023Updated 2 years ago
- Connecting Alexa with ChatGPT via custom Alexa skill to have continuous conversation with memory.☆10Mar 30, 2023Updated 3 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Jun 1, 2024Updated last year
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Mar 30, 2026Updated 2 weeks ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆252Feb 10, 2026Updated 2 months ago
- visualize history. Nuxt + D3☆12Jun 20, 2018Updated 7 years ago
- Create Unmute voice embeddings☆25Nov 15, 2025Updated 5 months ago
- Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit☆62Aug 16, 2023Updated 2 years ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆37Nov 16, 2022Updated 3 years ago
- A custom node wrapper for Kokoro TTS for ComfyUI☆51Mar 22, 2026Updated 3 weeks ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆77Jun 19, 2025Updated 9 months ago
- Everthing related to virtual try-on. Research papers, articles, projects, code, datasets, demos, videos, books, workshops, APIs, etc.☆17Jun 19, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation☆31Dec 10, 2025Updated 4 months ago
- chatterbox TTS + Voice Clone using onnx☆27Dec 31, 2025Updated 3 months ago
- Learn by example : Operating system☆33Apr 23, 2017Updated 8 years ago
- This is a legacy repo. Dev occurs now on GitHub.☆11Mar 28, 2021Updated 5 years ago
- Performant and accurate speech recognition built on Pytorch☆254May 19, 2022Updated 3 years ago
- A screaming vocal samples dataset.☆13Apr 14, 2023Updated 3 years ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,734Updated this week