fedderrico/ubm_map_diarization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fedderrico/ubm_map_diarization)

fedderrico / ubm_map_diarization

Speaker diarization with GMM-UBM and MAP Adaptation

☆31

Alternatives and similar repositories for ubm_map_diarization

Users that are interested in ubm_map_diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

scelesticsiva / speaker_recognition_GMM_UBM
View on GitHub
A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schiz…
☆55Jun 13, 2018Updated 8 years ago
dominoanty / SpeakerRecognition
View on GitHub
Implementing speaker recognition using Python (GMM-UBM)
☆29Apr 20, 2018Updated 8 years ago
luan78zaoha / kaldi-timit-sre-ivector
View on GitHub
Develop speaker recognition model based on i-vector using TIMIT database
☆16Jul 4, 2019Updated 7 years ago
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
Lhx94As / PHO-LID
View on GitHub
PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification
☆21Aug 24, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chislab / void-voice-liveness-detection
View on GitHub
Reproduction of paper Void: A Fast and Light Voice Liveness Detection System
☆19Aug 19, 2020Updated 5 years ago
david-ryan-snyder / kaldi
View on GitHub
This is now the official location of the Kaldi project.
☆10Aug 22, 2019Updated 6 years ago
skit-ai / Map-Mix
View on GitHub
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…
☆18Feb 17, 2023Updated 3 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
netankit / AudioMLProject1
View on GitHub
Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a cla…
☆18May 3, 2015Updated 11 years ago
joaoantoniocn / AM-SincNet
View on GitHub
The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…
☆46Oct 3, 2023Updated 2 years ago
david8862 / rnnoise
View on GitHub
Recurrent neural network for audio noise reduction
☆12Aug 18, 2022Updated 3 years ago
abhijeet3922 / Speaker-identification-using-GMMs
View on GitHub
It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data co…
☆58Oct 4, 2019Updated 6 years ago
prachiisc / PLDA_scoring
View on GitHub
Implements PLDA score computation using pretrained PLDA model for speaker diarization
☆18Oct 3, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bootphon / features_extraction
View on GitHub
audio cfeatures extraction tool from wav to h5features format
☆19May 24, 2019Updated 7 years ago
noiseux1523 / NIST-SRE-2019
View on GitHub
Score Normalization for NIST 2019 Speaker Recognition Evaluation
☆10Nov 8, 2019Updated 6 years ago
i3thuan5 / FaNT
View on GitHub
Filtering and Noise Adding Tool
☆29May 27, 2022Updated 4 years ago
adamcsvarga / speaker-clustering
View on GitHub
Unsupervised Speaker Clustering & Speaker Recognition
☆13Jan 7, 2019Updated 7 years ago
kamperh / speech_correspondence
View on GitHub
Correspondence and autoencoder neural network training for speech using Pylearn2.
☆14Dec 9, 2015Updated 10 years ago
xuchenglin28 / speech_separation
View on GitHub
Constrained Permutation Invariant Training, Speech Separation
☆52Jan 24, 2021Updated 5 years ago
nipunmanral / Spoken-Language-Identification
View on GitHub
Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features
☆25Aug 2, 2024Updated last year
chenllliang / CTDNN
View on GitHub
MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition
☆11Dec 4, 2021Updated 4 years ago
manojpamk / pytorch_xvectors
View on GitHub
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
☆321Nov 11, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bootphon / shennong
View on GitHub
A Python toolbox for speech features extraction
☆166Feb 8, 2023Updated 3 years ago
fangfm / lcnn
View on GitHub
A TensorFlow implementation of light convolutional neural network (LCNN)
☆12Dec 27, 2018Updated 7 years ago
kylesayrs / GMMPytorch
View on GitHub
Pytorch implementation of same-family gaussian mixture models with guardrails. Features separable parameter optimization and singularity …
☆27May 31, 2025Updated last year
vzxxbacq / PLDA
View on GitHub
This is a implementation of kaldi-plda.
☆15Jun 9, 2018Updated 8 years ago
msh9184 / contrastive-equilibrium-learning
View on GitHub
☆21Apr 6, 2021Updated 5 years ago
scarletcho / prep4kaldi
View on GitHub
Data preparation code for building Kaldi ASR system
☆14Mar 18, 2017Updated 9 years ago
finejuly / dcase2018_task2_cochlearai
View on GitHub
Cochlear.ai submission for dcase2018 task2
☆15Sep 14, 2018Updated 7 years ago
qianhwan / KaldiBasedSpeakerVerification
View on GitHub
Kaldi based speaker verification
☆47Jan 26, 2018Updated 8 years ago
AntonioAlgaida / Vocals2Song
View on GitHub
Tensorflow implementation of pix2pix for creating music from a voice. Vocals2Song.
☆17Sep 26, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
etosworld / etos-image-assessment
View on GitHub
Neural Image Assessment, a tool to automatically inspect quality of images.
☆12Mar 1, 2022Updated 4 years ago
CMsmartvoice / Unet-TTS
View on GitHub
One-shot TTS with Improved Unseen Speaker and Style Transfer
☆37Mar 2, 2022Updated 4 years ago
yogeshbalaji / Normalized-Wasserstein
View on GitHub
Normalized Wasserstein for Mixture Distributions
☆11Mar 24, 2023Updated 3 years ago
tzyll / kaldi
View on GitHub
ASR cases for speech handbook at CSLT-THU, based on Kaldi toolkit and Thchs30 database, in egs/cslt_cases.
☆107Mar 12, 2021Updated 5 years ago
jahin07 / optic-cup-disc
View on GitHub
This project contains a code written in Python to separate the optic disc and optic cup of the Retinal section of the human eye to aid in…
☆11Sep 4, 2017Updated 8 years ago
cswin / AutoRetinalImageSegmentation
View on GitHub
This code is used for joint optic disc and cup segmentation from retinal fundus images
☆12Feb 9, 2019Updated 7 years ago
yuxinhe / CASME2-Micro-Expression-Database-SVM
View on GitHub
CASME II: An Improved Spontaneous Micro-Expression Database and the Baseline Evaluation
☆10Oct 19, 2018Updated 7 years ago