Speaker diarization with GMM-UBM and MAP Adaptation
☆31Sep 13, 2018Updated 7 years ago
Alternatives and similar repositories for ubm_map_diarization
Users that are interested in ubm_map_diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schiz…☆55Jun 13, 2018Updated 7 years ago
- Implementing speaker recognition using Python (GMM-UBM)☆29Apr 20, 2018Updated 7 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Jul 4, 2019Updated 6 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- Reproduction of paper Void: A Fast and Light Voice Liveness Detection System☆19Aug 19, 2020Updated 5 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆114May 22, 2019Updated 6 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a cla…☆18May 3, 2015Updated 10 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- Recurrent neural network for audio noise reduction☆12Aug 18, 2022Updated 3 years ago
- Implements PLDA score computation using pretrained PLDA model for speaker diarization☆18Oct 3, 2020Updated 5 years ago
- audio cfeatures extraction tool from wav to h5features format☆19May 24, 2019Updated 6 years ago
- It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data co…☆58Oct 4, 2019Updated 6 years ago
- This repository contains the code for the paper: "DeToxy: A Large-Scale Multimodal Dataset for Toxicity Classification in Spoken Utteranc…☆20Oct 13, 2022Updated 3 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Unsupervised Speaker Clustering & Speaker Recognition☆13Jan 7, 2019Updated 7 years ago
- Constrained Permutation Invariant Training, Speech Separation☆52Jan 24, 2021Updated 5 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆25Aug 2, 2024Updated last year
- Pytorch implementation of same-family gaussian mixture models with guardrails. Features separable parameter optimization and singularity …☆26May 31, 2025Updated 9 months ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆321Nov 11, 2020Updated 5 years ago
- A Python toolbox for speech features extraction☆165Feb 8, 2023Updated 3 years ago
- A TensorFlow implementation of light convolutional neural network (LCNN)☆12Dec 27, 2018Updated 7 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆14Dec 9, 2015Updated 10 years ago
- This is a implementation of kaldi-plda.☆15Jun 9, 2018Updated 7 years ago
- MobileNet trained with VoxCeleb dataset and used for voice verification☆18Oct 26, 2022Updated 3 years ago
- ☆21Apr 6, 2021Updated 4 years ago
- Cochlear.ai submission for dcase2018 task2☆15Sep 14, 2018Updated 7 years ago
- Kaldi based speaker verification☆47Jan 26, 2018Updated 8 years ago
- Tensorflow implementation of pix2pix for creating music from a voice. Vocals2Song.☆17Sep 26, 2022Updated 3 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- This my implementation of sphereface using Pytorch on MNIST☆10Apr 5, 2019Updated 6 years ago
- ☆106Mar 12, 2021Updated 5 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆33Apr 24, 2020Updated 5 years ago
- CASME II: An Improved Spontaneous Micro-Expression Database and the Baseline Evaluation☆10Oct 19, 2018Updated 7 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Apr 2, 2019Updated 6 years ago