Speech Diarization for scrum automation
☆111Jul 27, 2023Updated 2 years ago
Alternatives and similar repositories for Speaker_diarization
Users that are interested in Speaker_diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,525Feb 23, 2026Updated 3 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆70Apr 22, 2026Updated last month
- ☆11Oct 25, 2021Updated 4 years ago
- Vietnamese Punctuation Prediction using Pretrained Language Models☆14May 8, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆14Feb 5, 2025Updated last year
- Videos Transcription and Translation with Faster Whisper and ChatGPT☆240Apr 13, 2024Updated 2 years ago
- An example FastAPI server that streams messages from Autogen using OpenAI API format☆15Jul 3, 2024Updated last year
- A toolkit for speaker diarization.☆462Apr 9, 2026Updated last month
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Apr 15, 2020Updated 6 years ago
- ONNX Inference of Pyannote Segmentation☆97Dec 23, 2024Updated last year
- Faster Whisper transcription with CTranslate2☆85Nov 29, 2023Updated 2 years ago
- ☆34Aug 26, 2025Updated 9 months ago
- LINEBot☆13Apr 7, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆17Jan 7, 2025Updated last year
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- ☆15Sep 8, 2023Updated 2 years ago
- Balanced Error Rate for Speaker Diarization☆33Feb 28, 2023Updated 3 years ago
- self hosted whisper api system based on container☆64Sep 4, 2024Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆179May 7, 2026Updated 2 weeks ago
- EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine☆45Jan 11, 2024Updated 2 years ago
- Automatically add explanations of unfamiliar words in ebooks☆15Feb 9, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Apple PodCast Transcription with OpenAI's Whisper☆347Dec 3, 2023Updated 2 years ago
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆26Jan 4, 2024Updated 2 years ago
- Incredibly fast Whisper-large-v3☆1,872Feb 16, 2024Updated 2 years ago
- Realtime Audio SDK for the Web — audio capture, echo cancellation (AEC), voice activity detection (VAD), and real-time encoding (Opus/PCM…☆124Dec 6, 2025Updated 5 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆22,043Apr 4, 2026Updated last month
- Chooat is an open-source project designed to provide a seamless and powerful AI chat experience.☆22Jan 15, 2025Updated last year
- Use local llama LLM or openai to chat, discuss/summarize your documents, youtube videos, and so on.☆154Dec 18, 2024Updated last year
- ☆16Feb 19, 2026Updated 3 months ago
- ☆110May 6, 2026Updated 3 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Efficient approach to speaker diarization using voice characteristics extraction☆107Jun 17, 2025Updated 11 months ago
- WhisperPlus: Faster, Smarter, and More Capable 🚀☆1,949May 4, 2026Updated 3 weeks ago
- segment anything model (SAM) infer by ncnn on Android mobile phone☆30Oct 7, 2023Updated 2 years ago
- Open source inference code for Rev's model☆437Apr 22, 2025Updated last year
- Experiments with GAN, WGAN, WGAN-GP, DC-GAN, cGAN, AC,GAN and pix2pix☆10May 28, 2019Updated 6 years ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,953May 19, 2026Updated last week
- A class for generating realistic audio (TTS) for podcasts and dialogues.☆65Dec 8, 2024Updated last year