TaoRuijie/Speaker-Recognition-Demo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TaoRuijie/Speaker-Recognition-Demo)

TaoRuijie / Speaker-Recognition-Demo

A ResNet Speaker Recognition&Verification Demo

☆27

Alternatives and similar repositories for Speaker-Recognition-Demo

Users that are interested in Speaker-Recognition-Demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TaoRuijie / Loss-Gated-Learning
View on GitHub
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
☆92May 29, 2023Updated 3 years ago
TaoRuijie / ECAPA-TDNN
View on GitHub
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
☆821Apr 11, 2024Updated 2 years ago
TaoRuijie / TalkNet-ASD
View on GitHub
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
☆488Oct 23, 2023Updated 2 years ago
TaoRuijie / AVCleanse
View on GitHub
ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'
☆44Oct 31, 2022Updated 3 years ago
Qiqi-Dai / 3DInvNet
View on GitHub
☆27Jan 21, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Jiang-Yidi / TS-TalkNet
View on GitHub
INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues
☆61May 29, 2023Updated 3 years ago
MlWoo / sentence2pinyin
View on GitHub
tts fronted-end
☆11Dec 19, 2018Updated 7 years ago
walkoncross / voxceleb2-download-zyf
View on GitHub
Tools for downloading VoxCeleb2 dataset
☆35Mar 16, 2024Updated 2 years ago
itmo-mbss-lab / sr_labs_book
View on GitHub
The project is related to the development of labs for the ITMO Speaker Recognition Course.
☆16Jul 3, 2026Updated 2 weeks ago
AlekseyKorshuk / accompaniment-generator
View on GitHub
Generate accompaniment part with chords using Evolutionary algorithm.
☆11May 8, 2022Updated 4 years ago
IVIosab / music-accompaniment-generator
View on GitHub
An evolutionary algorithm that generates an accompaniment to a given melody that consists of triad chords while following music theory ru…
☆10Sep 19, 2022Updated 3 years ago
TaoRuijie / SEANet
View on GitHub
Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)
☆32Feb 28, 2025Updated last year
pthang23 / Singing_Voice_Transcription
View on GitHub
☆17Jan 31, 2023Updated 3 years ago
york135 / CTC_CE_for_AST
View on GitHub
The official repo/implementation of the paper "Training a Singing Transcription Model Using Connectionist Temporal Classification Loss an…
☆12Mar 25, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AswinKumar1 / Forced-Alignment
View on GitHub
GSoC'16 RedHen Labs
☆11Aug 22, 2016Updated 9 years ago
zexupan / MuSE
View on GitHub
☆42Nov 22, 2024Updated last year
zabir-nabil / awesome-speaker-recognition-verification
View on GitHub
A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.
☆15Aug 29, 2021Updated 4 years ago
ian-k-1217 / Fully-Generalized-Non-Local-Network
View on GitHub
☆10Jun 2, 2021Updated 5 years ago
LaunchCodeEducation / java-web-dev-exercises
View on GitHub
demos, exercise sets, and studios for java-web-development
☆11Jun 14, 2023Updated 3 years ago
claytonblythe / neuralMusic
View on GitHub
An attempt at genre classification with convolutional neural networks and spectrograms
☆15Nov 25, 2017Updated 8 years ago
v-iashin / VoxCeleb
View on GitHub
An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset
☆12Dec 11, 2019Updated 6 years ago
iamhankai / voiceMusicSeparation
View on GitHub
Voice Music Separation competing for 6th Huawei Cup in ZJU
☆11Jun 2, 2015Updated 11 years ago
AmphionTeam / SD-Eval
View on GitHub
[NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
☆57Jun 25, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
nii-yamagishilab / Attention_Backend_for_ASV
View on GitHub
Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances
☆50Oct 27, 2022Updated 3 years ago
clovaai / voxceleb_trainer
View on GitHub
In defence of metric learning for speaker recognition
☆1,170Apr 22, 2026Updated 2 months ago
renesemela / masters-thesis-music-autotagging
View on GitHub
Master's Thesis: Automatic Tagging of Musical Compositions Using Machine Learning Methods
☆17May 22, 2023Updated 3 years ago
FloretCat / CMRAN
View on GitHub
Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization， ACM MM 2020
☆33Nov 6, 2020Updated 5 years ago
Finn-Fengming / Vggvox-TensorFlow
View on GitHub
Implementation of the VGGVox network using TensorFlow.
☆17Mar 20, 2026Updated 4 months ago
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
ZJier / PDCNet
View on GitHub
Code of paper "Densely Connected Pyramidal Dilated Convolutional Network for Hyperspectral Image Classification"
☆10Jun 21, 2022Updated 4 years ago
lucasjinreal / textfrontend
View on GitHub
单独维护的中文TTS
☆34Oct 28, 2022Updated 3 years ago
manishpandit / speaker-recognition
View on GitHub
Text independent speaker recognition algorithm based on CNN
☆24Aug 30, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Janie1996 / AV4SER
View on GitHub
PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition
☆12Mar 20, 2022Updated 4 years ago
mavceleb / mavceleb_baseline
View on GitHub
☆11Nov 5, 2025Updated 8 months ago
cuhksz-nlp / McASP
View on GitHub
☆12Dec 23, 2022Updated 3 years ago
JusperLee / speechbrain-docs-zh-cn
View on GitHub
SpeechBrain中文文档
☆12Mar 20, 2021Updated 5 years ago
SiddGururani / Pytorch-TDNN
View on GitHub
☆99Dec 20, 2017Updated 8 years ago
jinyeying / Awesome-Nighttime-Enhancement
View on GitHub
Collection of recent nighttime enhancement works, including papers, codes, datasets, and metrics.
☆122Apr 14, 2024Updated 2 years ago
PeihaoChen / regnet
View on GitHub
Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned S…
☆53Dec 15, 2020Updated 5 years ago