wq2012/SpeakerRecognitionFromScratch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wq2012/SpeakerRecognitionFromScratch)

wq2012 / SpeakerRecognitionFromScratch

Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家

☆47

Alternatives and similar repositories for SpeakerRecognitionFromScratch

Users that are interested in SpeakerRecognitionFromScratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Kevinnan-teen / Speaker-Recognition
View on GitHub
说话人识别（声纹识别）算法的Python实现。包括GMM（已完成）、GMM-UBM、ivector、基于深度学习的声纹识别（self-attention已完成）。
☆108Feb 21, 2023Updated 3 years ago
prmelehan / Speaker-Recognition
View on GitHub
Recognizing a speaker using Deep Learning
☆11Dec 25, 2017Updated 8 years ago
mycrazycracy / speaker-embedding-with-phonetic-information
View on GitHub
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
☆45Jul 10, 2019Updated 7 years ago
yzyouzhang / SASV_PR
View on GitHub
Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"
☆18Jun 24, 2022Updated 4 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
google / speaker-id
View on GitHub
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…
☆453Aug 12, 2025Updated 11 months ago
jymsuper / SpeakerRecognition_tutorial
View on GitHub
Simple d-vector based Speaker Recognition (verification and identification) using Pytorch
☆211Jul 17, 2020Updated 6 years ago
wq2012 / CurriculumVitae
View on GitHub
Curriculum Vitae of Quan Wang
☆15Dec 13, 2025Updated 7 months ago
ml-for-speech / speechtoolkit
View on GitHub
[Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…
☆22Jan 10, 2025Updated last year
Plachtaa / ASTRAL-quantization
View on GitHub
speaker-disentangled speech linguistic content quantizer
☆26Mar 19, 2025Updated last year
rosrad / asvspoof2017
View on GitHub
some scripts for asvspoof2017
☆11Dec 27, 2018Updated 7 years ago
liuhuang31 / Megatts2_HierSpeechpp
View on GitHub
Megatts2 use HierSpeechpp's vocoder
☆18Dec 2, 2024Updated last year
kleinzcy / speech_signal_processing
View on GitHub
☆15Jul 15, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AIwithhassan / ai-lawyer-rag-with-deepseek
View on GitHub
☆15Feb 8, 2025Updated last year
starrywongx / Semantic-Communication-for-MNIST
View on GitHub
☆27Dec 14, 2022Updated 3 years ago
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
vincentqb / audio-tutorial
View on GitHub
Experiments and tutorials with and for torchaudio
☆13May 7, 2021Updated 5 years ago
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 4 years ago
Ming-er / Audio-Free-P-Tuning
View on GitHub
☆11Dec 28, 2023Updated 2 years ago
athena-team / DiDiSpeech
View on GitHub
☆45Oct 24, 2020Updated 5 years ago
NeuroByte-Consulting / Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs
View on GitHub
Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MF…
☆12Apr 28, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
giovana-morais / steme
View on GitHub
[ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation
☆13Aug 2, 2023Updated 2 years ago
cw-seo / SiReN-reco
View on GitHub
☆11Sep 29, 2022Updated 3 years ago
Jiaju-Chen / UpliftRec
View on GitHub
this is a work about UpliftRec
☆10Dec 10, 2024Updated last year
Lhx94As / PHO-LID
View on GitHub
PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification
☆21Aug 24, 2023Updated 2 years ago
kyegomez / SoundStream
View on GitHub
Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"
☆13Jan 27, 2025Updated last year
nikvaessen / w2v2-speaker
View on GitHub
Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053
☆144May 10, 2022Updated 4 years ago
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
rmarcacini / ser-coraa-pt-br
View on GitHub
Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech
☆22Mar 21, 2022Updated 4 years ago
kleberandrade / evolve-kart-unity
View on GitHub
Example of application of genetic algorithm for evolution kart navigation.
☆11Nov 21, 2019Updated 6 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
123456789asdfjkl / computer-science-in-DLUT-2-
View on GitHub
大工计算机系大二学年资料
☆10May 15, 2020Updated 6 years ago
SongZihuan / CMakeLearn
View on GitHub
CMake的实例程序
☆13Sep 27, 2021Updated 4 years ago
merlresearch / sebbs
View on GitHub
Prediction of sound event bounding boxes (SEBBs)
☆35Aug 2, 2024Updated last year
itsnotacie / AAAI-26_SepPrune
View on GitHub
SepPrune: Structured Pruning for Efficient Deep Speech Separation-AAAI'26
☆15May 31, 2025Updated last year
iiscleap / NISP-Dataset
View on GitHub
☆31Aug 9, 2022Updated 3 years ago
zknus / Graph-Diffusion-CDE
View on GitHub
Graph Neural Convection-Diffusion with Heterophily
☆11May 29, 2023Updated 3 years ago
aqtq314 / VogenSVS
View on GitHub
☆15Apr 16, 2026Updated 3 months ago