zycv/Speaker-Recognition-Based-on-Deep-Learning-An-Overview

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zycv/Speaker-Recognition-Based-on-Deep-Learning-An-Overview)

zycv / Speaker-Recognition-Based-on-Deep-Learning-An-Overview

This repo is to list the references papers of 《Speaker Recognition Based on Deep Learning: An Overview》

☆41

Alternatives and similar repositories for Speaker-Recognition-Based-on-Deep-Learning-An-Overview

Users that are interested in Speaker-Recognition-Based-on-Deep-Learning-An-Overview are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhenghuatan / GMM-UBM_MAP_SV
View on GitHub
Python code for training and testing of GMM-UBM and maximum a posterirori (MAP) adaptation based speaker verification
☆20Jul 31, 2020Updated 5 years ago
PunkMale / OR-Gate
View on GitHub
Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.
☆12Oct 23, 2023Updated 2 years ago
nidwbin / AS-Norm
View on GitHub
A implement of adaptive score normalization (AS-Norm) in speaker verification/recognition with pytorch
☆10Oct 12, 2022Updated 3 years ago
zycv / OpenSpeaker
View on GitHub
OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…
☆68Feb 16, 2022Updated 4 years ago
TaoRuijie / AVCleanse
View on GitHub
ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'
☆44Oct 31, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
qiny1012 / kaldi_x-vector_aishell
View on GitHub
Using Kaldi x-vector method to train speaker recognition model on aishell database.
☆18Aug 19, 2021Updated 4 years ago
Snowdar / asv-subtools
View on GitHub
An Open Source Tools for Speaker Recognition
☆638Aug 5, 2024Updated last year
josebeo2016 / BTS-Encoder-ASVspoof
View on GitHub
Synthesis speech detection based on Breathing-Talking-Silence sounds
☆21Sep 3, 2025Updated 10 months ago
TaoRuijie / MFV-KSD
View on GitHub
Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)
☆22Jul 25, 2024Updated 2 years ago
lawlict / ECAPA-TDNN
View on GitHub
☆106Sep 2, 2021Updated 4 years ago
TaoRuijie / ECAPA-TDNN
View on GitHub
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
☆823Apr 11, 2024Updated 2 years ago
ibliever / Cross-modal-information-fusion-for-voice-spoofing-detection
View on GitHub
This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"
☆13Jun 5, 2023Updated 3 years ago
double22a / speech_dataset
View on GitHub
The dataset of Speech Recognition
☆464Jan 4, 2026Updated 6 months ago
georgetz15 / mss-thesis
View on GitHub
Pytorch implementation of MDensenet and sparse NMF. Made for my undergraduate thesis "Music Source Separation with Supervised Learning Me…
☆11Jan 31, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
clovaai / voxceleb_trainer
View on GitHub
In defence of metric learning for speaker recognition
☆1,170Apr 22, 2026Updated 3 months ago
gauthamsuresh09 / wav2vec2-large-xlsr-53-malayalam
View on GitHub
Wav2vec2 Large XLSR 53 fine-tuned for Malayalam
☆11Sep 7, 2021Updated 4 years ago
mayank-git-hub / ETE-Speech-Recognition
View on GitHub
Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch
☆26Jul 25, 2024Updated 2 years ago
ian-k-1217 / Fully-Generalized-Non-Local-Network
View on GitHub
☆10Jun 2, 2021Updated 5 years ago
subham2203 / reimagined-winner
View on GitHub
CIFAR-10 Object Detection with improved accuracy using Fractional MaxPooling with Convolutional Neural Networks
☆12Aug 6, 2017Updated 8 years ago
LCF2764 / speaker-feature-extractor
View on GitHub
说话人特征（声纹）提取工具，基于VGG-SR预训练模型。
☆40Mar 7, 2020Updated 6 years ago
yzyouzhang / SASV_PR
View on GitHub
Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"
☆18Jun 24, 2022Updated 4 years ago
MiukkaZh / MGT
View on GitHub
Learning Domain-Invariant Transformation for Speaker Verification.
☆11Jun 13, 2023Updated 3 years ago
themichaellam / tsne-d3-python
View on GitHub
Visualize high dimensional data with t-sne using D3 and Python
☆16Aug 29, 2016Updated 9 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
alumae / voxlingua107_sb
View on GitHub
VoxLingua107 recipe for SpeechBrain
☆13Jul 3, 2021Updated 5 years ago
lightning830 / E2E-audio-speech-recognition
View on GitHub
Conformer encoder + Transformer decoder with Hybrid CTC/attention
☆12Nov 11, 2021Updated 4 years ago
GATECH-EIC / S3-Router
View on GitHub
[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…
☆17Sep 19, 2023Updated 2 years ago
wq2012 / SpeakerRecognitionCourseChinese
View on GitHub
☆17Oct 31, 2022Updated 3 years ago
zyzisyz / mfa_conformer
View on GitHub
☆160Jan 9, 2023Updated 3 years ago
sasv-challenge / SASV2_Baseline
View on GitHub
SASV2 baseline, a track on ASVspoof5 phase2 challenge
☆27Nov 12, 2025Updated 8 months ago
lixucuhk / ASV-anti-spoofing-with-Res2Net
View on GitHub
Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture (ICASSP 2021) https://arxiv.org/abs/2010.150…
☆84Oct 21, 2021Updated 4 years ago
kchan7 / WER-CER
View on GitHub
Calculator Tool of Word Error Rate and Character Error Rate
☆14Nov 3, 2020Updated 5 years ago
yuxi120407 / semi-supervised_tensorflow2.0
View on GitHub
This is an Tensorflow implementation of semi-supervised learning with the following methods: Pseudo-label, Pi_model, VAT, mean_teacher, M…
☆12Jul 23, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
xiaoxiaomiao323 / MSA
View on GitHub
☆16Feb 19, 2026Updated 5 months ago
yeyupiaoling / VoiceprintRecognition-Pytorch
View on GitHub
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not exclud…
☆1,307Dec 17, 2025Updated 7 months ago
cuhksz-nlp / McASP
View on GitHub
☆12Dec 23, 2022Updated 3 years ago
my-yy / sl_icmr2022
View on GitHub
Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"
☆15Oct 25, 2024Updated last year
JusperLee / speechbrain-docs-zh-cn
View on GitHub
SpeechBrain中文文档
☆12Mar 20, 2021Updated 5 years ago
vlievin / gan-experiments-pytorch
View on GitHub
Experiments with GAN, WGAN, WGAN-GP, DC-GAN, cGAN, AC,GAN and pix2pix
☆10May 28, 2019Updated 7 years ago
prajual / Master-Voice_Prints
View on GitHub
This Repository includes four different implementations of the Speaker Verification task including the GMM_UBM, Ivector, Deep-Speaker, an…
☆32Jul 3, 2018Updated 8 years ago