yucongzh / online_speaker_diarizationView external linksLinks
☆14Jul 11, 2022Updated 3 years ago
Alternatives and similar repositories for online_speaker_diarization
Users that are interested in online_speaker_diarization are comparing it to the libraries listed below
Sorting:
- ☆12Jun 14, 2022Updated 3 years ago
- Exploring Binary Classification Loss for Speaker Verification☆18Jul 18, 2023Updated 2 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- ☆18Mar 4, 2023Updated 2 years ago
- Production first, nn-based on-device signal processing toolkit.☆65May 30, 2023Updated 2 years ago
- Balanced Error Rate for Speaker Diarization☆33Feb 28, 2023Updated 2 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆31Apr 29, 2022Updated 3 years ago
- ☆30Jul 21, 2022Updated 3 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Jul 21, 2020Updated 5 years ago
- Clustering-based methods for overlapping diarization☆82Jan 12, 2024Updated 2 years ago
- MagicData-RAMC Dataset and Baseline☆57Sep 13, 2022Updated 3 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Mar 6, 2023Updated 2 years ago
- Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…☆40Jun 17, 2025Updated 7 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Dec 31, 2021Updated 4 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Apr 15, 2020Updated 5 years ago
- Python package for combining diarization system outputs.☆92Oct 12, 2023Updated 2 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- ☆11Jun 14, 2024Updated last year
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 3 months ago
- Docker for building an environment for Dutch online and offline ASR.☆12Feb 2, 2021Updated 5 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 7 months ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 3 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆29Sep 20, 2021Updated 4 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆45May 13, 2025Updated 9 months ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Transfer learning approach to pronunciation scoring☆11Jan 17, 2024Updated 2 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago