A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.
☆15Aug 29, 2021Updated 4 years ago
Alternatives and similar repositories for awesome-speaker-recognition-verification
Users that are interested in awesome-speaker-recognition-verification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Score calibration for speaker verification☆25Dec 13, 2019Updated 6 years ago
- ☆105Sep 2, 2021Updated 4 years ago
- Implementing VGGVox for Speaker Identification on VoxCeleb1 dataset in PyTorch.☆25Oct 15, 2020Updated 5 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆97Sep 15, 2021Updated 4 years ago
- Python code for training and testing of GMM-UBM and maximum a posterirori (MAP) adaptation based speaker verification☆20Jul 31, 2020Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Speaker Verification using Pytorch☆13May 23, 2024Updated last year
- Recognizing a speaker using Deep Learning☆11Dec 25, 2017Updated 8 years ago
- Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender☆13Oct 27, 2017Updated 8 years ago
- Text-Independent Speaker Recognition Using Gaussian Mixture Models☆12Jul 1, 2015Updated 10 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- ☆13Jan 10, 2017Updated 9 years ago
- A curated list of speaker-embedding speaker-verification, speaker-identification resources.☆52Aug 12, 2021Updated 4 years ago
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset☆12Dec 11, 2019Updated 6 years ago
- Wav2kws is keyword spotting (KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Google Speech Commands datasets V1 and V2.☆13Jun 11, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A ResNet Speaker Recognition&Verification Demo☆26Oct 19, 2021Updated 4 years ago
- Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language d…☆11May 29, 2016Updated 9 years ago
- Awesome Speech Dataset, including download links and a brief explanation for each resource. These datasets provide diverse and high-quali…☆26Jul 4, 2025Updated 8 months ago
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- Tools for downloading VoxCeleb2 dataset☆33Mar 16, 2024Updated 2 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- [ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification☆16Mar 20, 2024Updated 2 years ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Time-domain Audio Separation Network (IN PYTORCH)☆23Jan 28, 2019Updated 7 years ago
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Jun 28, 2015Updated 10 years ago
- Corresponding post at http://efavdb.com/gaussian-processes/☆22Dec 29, 2017Updated 8 years ago
- This is a implementation of kaldi-plda.☆15Jun 9, 2018Updated 7 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"☆55Feb 15, 2022Updated 4 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Jul 16, 2022Updated 3 years ago
- Edit mp3 tags in the terminal, tui style☆16Jun 22, 2020Updated 5 years ago
- ☆159Jan 9, 2023Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Dec 18, 2021Updated 4 years ago
- Speech Localization and Separation using DNNs☆21Feb 6, 2017Updated 9 years ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆795Apr 11, 2024Updated last year
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Jul 10, 2019Updated 6 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- ☆11Dec 31, 2019Updated 6 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago