Speaker diarization benchmark framework
☆40Jan 8, 2026Updated 3 months ago
Alternatives and similar repositories for speaker-diarization-benchmark
Users that are interested in speaker-diarization-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pure-PyTorch Parakeet TDT inference☆36Mar 10, 2026Updated last month
- Spot the conversation: speaker diarisation in the wild☆161Jul 26, 2022Updated 3 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 4 years ago
- On-device speaker diarization powered by deep learning☆69Updated this week
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆94Oct 18, 2023Updated 2 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆47May 13, 2025Updated 11 months ago
- ☆12Mar 15, 2026Updated last month
- Diarization scoring tools.☆263Apr 8, 2026Updated last week
- A PHP function that can convert Spanish words into phonetic transcription written with IPA phonetic symbols.☆14Jan 26, 2016Updated 10 years ago
- ☆36Jan 6, 2026Updated 3 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆116Mar 1, 2026Updated last month
- Balanced Error Rate for Speaker Diarization☆33Feb 28, 2023Updated 3 years ago
- ☆13Sep 3, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Jan 24, 2022Updated 4 years ago
- On-device noise suppression powered by deep learning☆86Updated this week
- [ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.☆18Apr 21, 2025Updated 11 months ago
- ☆38Nov 18, 2025Updated 4 months ago
- Lighting and Rotation Invariant Real-time Vehicle Wheel Detector based on YOLOv5☆18Aug 24, 2025Updated 7 months ago
- ☆11Mar 1, 2023Updated 3 years ago
- ☆46Jan 22, 2024Updated 2 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated 10 months ago
- Headpose estimation using OPAL (2023)☆61Oct 28, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆24Jan 14, 2021Updated 5 years ago
- [ICML 2025 Tokenization Workshop] HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling☆90Sep 28, 2025Updated 6 months ago
- Script (meant to run via cron) to monitor, log, and alert when the CPU is throttled due to overheating☆12Oct 5, 2017Updated 8 years ago
- ☆54Oct 17, 2023Updated 2 years ago
- ☆92Jan 28, 2026Updated 2 months ago
- A flexible port forwarder among TCP, UNIX socket and (optionally) Tailscale, with PROXY protocol support, written in Golang.☆14Sep 24, 2024Updated last year
- Version control for all seed finding repos.☆11Jun 4, 2024Updated last year
- Dan's repository of OpenFst (manually created by downloading certain versions of OpenFst), created to track certain patches.☆13Mar 8, 2016Updated 10 years ago
- Multi Camera Face Detection and Recognition with Tracking☆18May 13, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Audio-Visual Active Speaker Detection with PyTorch on AVA-ActiveSpeaker dataset☆72Jan 18, 2022Updated 4 years ago
- Open source inference code for Rev's model☆435Apr 22, 2025Updated 11 months ago
- Legacy Tweak☆12Feb 6, 2023Updated 3 years ago
- Your models on any xPU☆60Updated this week
- Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation…☆32Jan 14, 2025Updated last year
- A SUS (Sliding Universal Score) parser and generator.☆10Feb 12, 2022Updated 4 years ago
- INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"☆117Jan 26, 2024Updated 2 years ago