Speaker diarization benchmark framework
☆40Jun 10, 2026Updated last week
Alternatives and similar repositories for speaker-diarization-benchmark
Users that are interested in speaker-diarization-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spot the conversation: speaker diarisation in the wild☆167Jul 26, 2022Updated 3 years ago
- Pure-PyTorch Parakeet TDT inference☆47Mar 10, 2026Updated 3 months ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 4 years ago
- On-device speaker diarization powered by deep learning☆73Jun 10, 2026Updated last week
- On-device speaker recognition engine powered by deep learning☆49Jun 10, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆96Oct 18, 2023Updated 2 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆47May 13, 2025Updated last year
- benchmark for Speech-to-Intent engines☆18Mar 27, 2026Updated 2 months ago
- ☆12Mar 15, 2026Updated 3 months ago
- Diarization scoring tools.☆267Apr 8, 2026Updated 2 months ago
- A PHP function that can convert Spanish words into phonetic transcription written with IPA phonetic symbols.☆14Jan 26, 2016Updated 10 years ago
- ☆37Jan 6, 2026Updated 5 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆119Mar 1, 2026Updated 3 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆25Mar 29, 2026Updated 2 months ago
- Balanced Error Rate for Speaker Diarization☆33Feb 28, 2023Updated 3 years ago
- ☆16Jan 24, 2022Updated 4 years ago
- ☆15Sep 3, 2024Updated last year
- On-device noise suppression powered by deep learning☆90Jun 10, 2026Updated last week
- Split a video based on black periods☆28Mar 30, 2026Updated 2 months ago
- ☆15Aug 22, 2020Updated 5 years ago
- ☆40Nov 18, 2025Updated 7 months ago
- 中译名著多译本翻译转述语料。语料仅限于用于科研教学活动。文本著作权归原著者。☆11Jul 26, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Feb 16, 2023Updated 3 years ago
- Lighting and Rotation Invariant Real-time Vehicle Wheel Detector based on YOLOv5☆18Aug 24, 2025Updated 9 months ago
- ☆11Mar 1, 2023Updated 3 years ago
- ☆68Feb 8, 2024Updated 2 years ago
- Interpolated Kneser-Ney smoothing with an out-of-vocabulary correction and discount estimated from training data☆13Dec 11, 2020Updated 5 years ago
- Korean ASR Corpus generated from TEDx talks☆27Jan 11, 2019Updated 7 years ago
- ☆47Jan 22, 2024Updated 2 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated last year
- Headpose estimation using OPAL (2023)☆61Oct 28, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆25Jan 14, 2021Updated 5 years ago
- init☆11Sep 30, 2017Updated 8 years ago
- 基于文本的垃圾短信分类_文本预处理☆13Jan 11, 2016Updated 10 years ago
- ☆15Nov 26, 2023Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆262Updated this week
- Landing Page for All Things Source Separation☆38Sep 12, 2025Updated 9 months ago
- [ICML 2025 Tokenization Workshop] HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling☆97Sep 28, 2025Updated 8 months ago