Liu-Tianchi/Golden-Gemini-for-Speaker-Verification

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Liu-Tianchi/Golden-Gemini-for-Speaker-Verification)

Liu-Tianchi / Golden-Gemini-for-Speaker-Verification

Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'

☆15

Alternatives and similar repositories for Golden-Gemini-for-Speaker-Verification

Users that are interested in Golden-Gemini-for-Speaker-Verification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zds-potato / multilingual-phonetic-sv
View on GitHub
☆10Dec 22, 2023Updated 2 years ago
msh9184 / ska-tdnn
View on GitHub
☆26Nov 2, 2022Updated 3 years ago
ductuantruong / enskd
View on GitHub
[ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification
☆16Mar 20, 2024Updated 2 years ago
Liu-Tianchi / Nes2Net_ASVspoof_ITW
View on GitHub
☆60Apr 4, 2026Updated 3 months ago
tango4j / Python-Speaker-Diarization
View on GitHub
Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"
☆11Apr 6, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
zyzisyz / mfa_conformer
View on GitHub
☆160Jan 9, 2023Updated 3 years ago
xi-j / Mamba-ASR
View on GitHub
ConMamba for Automatic Speech Recognition
☆106Aug 12, 2024Updated last year
ductuantruong / tcm_add
View on GitHub
[INTERSPEECH'24] Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection
☆61Dec 4, 2024Updated last year
ta012 / DTFAT
View on GitHub
[AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification
☆12Mar 10, 2025Updated last year
tarun360 / SpeakerProfiling
View on GitHub
Estimating the Age, Height, and Gender of a speaker with their speech signal.
☆15Sep 19, 2022Updated 3 years ago
wngh1187 / ExU-Net
View on GitHub
Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments
☆28Jul 24, 2023Updated 2 years ago
lcn-kul / conferencing-speech-2022
View on GitHub
Source code for LCN submission for ConferencingSpeech2022 challenge.
☆14Nov 11, 2023Updated 2 years ago
xjchenGit / awesome-audio-visual-deepfake
View on GitHub
awesome-audio-visual-robustness
☆11Jan 27, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Liu-Tianchi / Nes2Net
View on GitHub
☆109Apr 4, 2026Updated 3 months ago
speechcatcher-asr / speechcatcher-data
View on GitHub
☆11Sep 5, 2025Updated 10 months ago
Miamoto / Conformer-NTM
View on GitHub
☆16Nov 9, 2023Updated 2 years ago
YoshikiMas / madeon-asr
View on GitHub
[SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition
☆19Dec 1, 2024Updated last year
Lhx94As / PHO-LID
View on GitHub
PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification
☆21Aug 24, 2023Updated 2 years ago
LeiLiLab / InfiniSST
View on GitHub
☆24May 27, 2026Updated last month
ItzJuny / AMSDF
View on GitHub
[T-IFS'24] Audio Multi-view Spoofing Detection Framework Based on Audio-Text-Emotion Correlations
☆31Jul 31, 2024Updated last year
Purdue-M2 / AI-Synthesized-Voice-Generalization
View on GitHub
This repository is the official implementation of our paper "Improving Generalization for AI-Synthesized Voice Detection", which has been…
☆23Jan 13, 2026Updated 5 months ago
JunyiPeng00 / SLT22_MultiHead-Factorized-Attentive-Pooling
View on GitHub
An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification
☆24Sep 22, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
leohuang2013 / pyannote-audio_overlapped-speech-detection_cpp
View on GitHub
C++ version of pyannote audio overlapped speech detection pipeline
☆13Feb 14, 2024Updated 2 years ago
media-sec-lab / BAM
View on GitHub
The pytorch implementation of BAM for Partialspoof Audio Localization.
☆33Aug 16, 2024Updated last year
TakHemlata / T-EER
View on GitHub
Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"
☆14Sep 25, 2023Updated 2 years ago
mmmmayi / ExPO
View on GitHub
official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification
☆14Mar 14, 2025Updated last year
QiShanZhang / SLSforASVspoof-2021-DF
View on GitHub
Code for paper "Audio Deepfake Detection with Self-supervised XLS-R and SLS classifier
☆70Feb 7, 2025Updated last year
yzyouzhang / Audio_Research_in_US
View on GitHub
Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…
☆27Feb 27, 2026Updated 4 months ago
DanielLin94144 / Test-time-adaptation-ASR-SUTA
View on GitHub
Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…
☆23Apr 1, 2022Updated 4 years ago
nii-yamagishilab / PartialSpoof
View on GitHub
☆62Jul 15, 2024Updated last year
liuhao-lh / SMD
View on GitHub
Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'
☆11Mar 22, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
seongmin-kye / CAP
View on GitHub
Cross attentive pooling for speaker verification (IEEE SLT, 2021)
☆12Dec 14, 2020Updated 5 years ago
chimechallenge / chime-utils
View on GitHub
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
☆26Feb 25, 2025Updated last year
cyaaronk / audio_deepfake_eval
View on GitHub
☆24Sep 11, 2025Updated 10 months ago
statusrank / A-Generic-Framework-for-Optimizing-Two-way-Partial-AUC
View on GitHub
This is an official PyTorch code for our accepted paper "When All We Need is a Piece of the Pie: A Generic Framework for Optimizing Two-w…
☆15Jul 7, 2022Updated 4 years ago
MiukkaZh / MGT
View on GitHub
Learning Domain-Invariant Transformation for Speaker Verification.
☆11Jun 13, 2023Updated 3 years ago
JaesungHuh / VoxSRC2021
View on GitHub
Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2021
☆18Jul 21, 2021Updated 4 years ago
YMLLG / SPEECHFAKE
View on GitHub
SpeechFake: A Large-Scale Multilingual Speech Deepfake Dataset Incorporating Cutting-Edge Generation Methods
☆28Aug 13, 2025Updated 10 months ago