phonexiaresearch / VBx-training-recipeView external linksLinks
☆32Mar 11, 2022Updated 3 years ago
Alternatives and similar repositories for VBx-training-recipe
Users that are interested in VBx-training-recipe are comparing it to the libraries listed below
Sorting:
- Variational Bayes HMM over x-vectors diarization☆283Jan 15, 2024Updated 2 years ago
- ☆91Apr 24, 2025Updated 9 months ago
- Discriminative Training of VBx Diarization☆27Sep 23, 2024Updated last year
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆48Apr 19, 2023Updated 2 years ago
- Python package for combining diarization system outputs.☆92Oct 12, 2023Updated 2 years ago
- Julia package for Probabilistic Canonical Correlation Analysis☆12Mar 30, 2022Updated 3 years ago
- ☆59Mar 28, 2025Updated 10 months ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Dec 16, 2021Updated 4 years ago
- MagicData-RAMC Dataset and Baseline☆57Sep 13, 2022Updated 3 years ago
- Small compression utility☆38Jan 20, 2026Updated 3 weeks ago
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆126Apr 8, 2022Updated 3 years ago
- This repository will illustrate the use of some different backends on NIST SRE 2019.☆21Apr 25, 2020Updated 5 years ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆26Jan 11, 2022Updated 4 years ago
- neural network based speaker embedder☆25Jan 7, 2023Updated 3 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆133Jun 10, 2022Updated 3 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 6 months ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆79Oct 18, 2022Updated 3 years ago
- Diarization scoring tools.☆263Mar 28, 2023Updated 2 years ago
- Spot the conversation: speaker diarisation in the wild☆157Jul 26, 2022Updated 3 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆242Dec 16, 2025Updated 2 months ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- Scripts for exporting Kaldi labeled data into TensorFlow☆12Jul 31, 2019Updated 6 years ago
- ☆37Mar 30, 2021Updated 4 years ago
- An Open Source Tools for Speaker Recognition☆634Aug 5, 2024Updated last year
- ☆17Jan 26, 2021Updated 5 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- ☆17Jun 30, 2020Updated 5 years ago
- End-to-End Neural Diarization☆421Aug 30, 2021Updated 4 years ago
- This is a implementation of kaldi-plda.☆15Jun 9, 2018Updated 7 years ago
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- A casual and simple ChatGPT Python script that can run using terminal (as long as you have an API). Support Azure API.☆21May 3, 2025Updated 9 months ago
- Discriminative Neural Clustering for Speaker Diarisation☆79Apr 8, 2022Updated 3 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 6 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 8 months ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- Dereverberation of Speech Signals Using Weighted Prediction Error☆23May 17, 2019Updated 6 years ago
- ☆53Jan 15, 2021Updated 5 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆55Jan 2, 2020Updated 6 years ago