A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.
☆15Aug 29, 2021Updated 4 years ago
Alternatives and similar repositories for awesome-speaker-recognition-verification
Users that are interested in awesome-speaker-recognition-verification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Score calibration for speaker verification☆25Dec 13, 2019Updated 6 years ago
- ☆105Sep 2, 2021Updated 4 years ago
- Implementing VGGVox for Speaker Identification on VoxCeleb1 dataset in PyTorch.☆25Oct 15, 2020Updated 5 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆97Sep 15, 2021Updated 4 years ago
- Python code for training and testing of GMM-UBM and maximum a posterirori (MAP) adaptation based speaker verification☆20Jul 31, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Speaker Verification using Pytorch☆13May 23, 2024Updated last year
- Recognizing a speaker using Deep Learning☆11Dec 25, 2017Updated 8 years ago
- Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender☆13Mar 26, 2026Updated 3 weeks ago
- Text-Independent Speaker Recognition Using Gaussian Mixture Models☆12Jul 1, 2015Updated 10 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- ☆13Jan 10, 2017Updated 9 years ago
- A curated list of speaker-embedding speaker-verification, speaker-identification resources.☆52Aug 12, 2021Updated 4 years ago
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset☆12Dec 11, 2019Updated 6 years ago
- Wav2kws is keyword spotting (KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Google Speech Commands datasets V1 and V2.☆13Jun 11, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A ResNet Speaker Recognition&Verification Demo☆26Oct 19, 2021Updated 4 years ago
- Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language d…☆11May 29, 2016Updated 9 years ago
- Awesome Speech Dataset, including download links and a brief explanation for each resource. These datasets provide diverse and high-quali…☆26Jul 4, 2025Updated 9 months ago
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- Tools for downloading VoxCeleb2 dataset☆34Mar 16, 2024Updated 2 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- [ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification☆16Mar 20, 2024Updated 2 years ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Time-domain Audio Separation Network (IN PYTORCH)☆23Jan 28, 2019Updated 7 years ago
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Jun 28, 2015Updated 10 years ago
- Corresponding post at http://efavdb.com/gaussian-processes/☆22Dec 29, 2017Updated 8 years ago
- This is a implementation of kaldi-plda.☆15Jun 9, 2018Updated 7 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"☆55Feb 15, 2022Updated 4 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated this week
- Edit mp3 tags in the terminal, tui style☆17Jun 22, 2020Updated 5 years ago
- ☆160Jan 9, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Speech Localization and Separation using DNNs☆21Feb 6, 2017Updated 9 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Dec 18, 2021Updated 4 years ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆801Apr 11, 2024Updated 2 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Jul 10, 2019Updated 6 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- ☆11Dec 31, 2019Updated 6 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago