Speaker diarization with GMM-UBM and MAP Adaptation
☆31Sep 13, 2018Updated 7 years ago
Alternatives and similar repositories for ubm_map_diarization
Users that are interested in ubm_map_diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schiz…☆55Jun 13, 2018Updated 8 years ago
- Implementing speaker recognition using Python (GMM-UBM)☆29Apr 20, 2018Updated 8 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Jul 4, 2019Updated 6 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Reproduction of paper Void: A Fast and Light Voice Liveness Detection System☆19Aug 19, 2020Updated 5 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆115May 22, 2019Updated 7 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a cla…☆18May 3, 2015Updated 11 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- Recurrent neural network for audio noise reduction☆12Aug 18, 2022Updated 3 years ago
- audio cfeatures extraction tool from wav to h5features format☆19May 24, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implements PLDA score computation using pretrained PLDA model for speaker diarization☆18Oct 3, 2020Updated 5 years ago
- It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data co…☆58Oct 4, 2019Updated 6 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- This repository contains the code for the paper: "DeToxy: A Large-Scale Multimodal Dataset for Toxicity Classification in Spoken Utteranc…☆21Oct 13, 2022Updated 3 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 4 years ago
- Unsupervised Speaker Clustering & Speaker Recognition☆13Jan 7, 2019Updated 7 years ago
- Constrained Permutation Invariant Training, Speech Separation☆52Jan 24, 2021Updated 5 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆25Aug 2, 2024Updated last year
- Pytorch implementation of same-family gaussian mixture models with guardrails. Features separable parameter optimization and singularity …☆27May 31, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆321Nov 11, 2020Updated 5 years ago
- A Python toolbox for speech features extraction☆165Feb 8, 2023Updated 3 years ago
- A TensorFlow implementation of light convolutional neural network (LCNN)☆12Dec 27, 2018Updated 7 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆14Dec 9, 2015Updated 10 years ago
- Data preparation code for building Kaldi ASR system☆14Mar 18, 2017Updated 9 years ago
- This is a implementation of kaldi-plda.☆15Jun 9, 2018Updated 8 years ago
- MobileNet trained with VoxCeleb dataset and used for voice verification☆18Oct 26, 2022Updated 3 years ago
- ☆21Apr 6, 2021Updated 5 years ago
- Cochlear.ai submission for dcase2018 task2☆15Sep 14, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Neural Image Assessment, a tool to automatically inspect quality of images.☆12Mar 1, 2022Updated 4 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 3 years ago
- ☆106Mar 12, 2021Updated 5 years ago
- This my implementation of sphereface using Pytorch on MNIST☆10Apr 5, 2019Updated 7 years ago
- This code is used for joint optic disc and cup segmentation from retinal fundus images☆12Feb 9, 2019Updated 7 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆33Apr 24, 2020Updated 6 years ago