☆87Jan 28, 2026Updated last month
Alternatives and similar repositories for DiCoW
Users that are interested in DiCoW are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆104Mar 1, 2026Updated 3 weeks ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- Discriminative Training of VBx Diarization☆27Sep 23, 2024Updated last year
- CHiME-9 Task 1 - MCoRec baseline☆27Jan 13, 2026Updated 2 months ago
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆60Feb 12, 2025Updated last year
- ☆67Feb 8, 2024Updated 2 years ago
- Error correction back-end for speaker diarization☆18Sep 26, 2023Updated 2 years ago
- A simple package for Guided source separation (GSS)☆133May 20, 2024Updated last year
- ☆39Oct 14, 2022Updated 3 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆93Oct 18, 2023Updated 2 years ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆105Jan 10, 2025Updated last year
- ☆19Sep 19, 2023Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆13Feb 5, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆50May 14, 2025Updated 10 months ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- ☆24Feb 28, 2023Updated 3 years ago
- ☆16Apr 24, 2025Updated 11 months ago
- A toolkit for speaker diarization.☆420Mar 4, 2026Updated 3 weeks ago
- Python package for combining diarization system outputs.☆93Oct 12, 2023Updated 2 years ago
- ☆12Mar 11, 2025Updated last year
- ☆46Jan 22, 2024Updated 2 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆79Oct 18, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆85Jun 17, 2025Updated 9 months ago
- Clustering-based methods for overlapping diarization☆82Jan 12, 2024Updated 2 years ago
- ☆15Apr 2, 2025Updated 11 months ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆39Oct 27, 2025Updated 4 months ago
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆35Mar 22, 2021Updated 5 years ago
- Open-source reproducible benchmarks from Argmax☆83Mar 12, 2026Updated 2 weeks ago
- Avoids race condition when acquiring GPUs in exclusive mode☆19Nov 11, 2024Updated last year
- ☆30Jul 21, 2022Updated 3 years ago
- NeMo: a toolkit for conversational AI☆13May 4, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆19Jun 5, 2023Updated 2 years ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆61Jul 1, 2025Updated 8 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- ☆17Nov 25, 2019Updated 6 years ago
- ☆21Mar 4, 2024Updated 2 years ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆37Feb 11, 2025Updated last year
- ☆13Jul 23, 2024Updated last year